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TO ALL WHOM IT MAY CONCERN: 

Be it known that I, Donald E. Stauton, a citizen of the United 
States of America, residing at 6502 113*'' Avenue, NE, Kirldand, Washington 
98033, have invented a new and useful MATERIALS AND METHODS TO 
MODULATE LIGAND BINDING/ENZYMATIC ACTIVITY OF ap PROTEINS 
CONTAINING AN ALLOSTERIC REGULATORY SITE, of which the following is 
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MATERIALS AND METHODS TO MODULATE LIGAND 
BINDING/ENZYMATIC ACTIVITY OF a/p PROTEINS 
CONTAINING AN ALLOSTERIC REGULATORY SITE 

CROSS REFERENCE TO RELATED APPLICATION 

This application claims the benefit of U.S. Provisional Apphcation 
Serial No. 60/239 J50, filed October 12, 2000. 



FIELD OF THE INVENTION 

The present invention provides materials and methods to regulate 
binding activity of alpha/beta (a/p) molecules comprising an allosteric regulatory site. 

1 5 BACKGROUND OF THE INVENTION 

The alpha/beta (a/p) domain superfamily of proteins includes 
approximately ninety-seven families identified by specific fold structures. Proteins in 
the superfamily generally possess distinctive fold structures such as a TIM barrel, a 
horsehead fold or a beta-alpha-beta structure wherein a central beta sheet is 

20 surrounded by alpha heUces, and is formed from multiple beta strand domains 

arranged in a parallel, anti-parallel or mixed orientation. 

Many members of the superfamily, including proteins comprising an 
integrin I domain, von Willebrand factor comprising A domain structures, and various 
enzymes, have an open twisted beta sheet which gives rise to a fold in the protein's 

25 three dimensional structure. This fold is commonly referred to as a Rossmann fold, a 

Rossmann-like fold, or a dinucleotide binding fold. Many functionally diverse 
proteins contain Rossman folds, and these proteins can be identified using the SCOP, 
SMART, and CATH databases. A prototypic Rossmann fold is found at the site of 
NADP binding in glyceraldehyde-3-phosphate dehydrogenase. 

30 Many Rossmann domains include a functional site on the "upper face" 

of the central beta sheet. This site in, for example, integrin I domains, Rho/Rac 
GTPases, and heterotrimeric GTPases, permits coordinated metal ion binding. In at 
least some integrin I domains, the bound metal ion forms a critical direct contact with 
a bound ligand and this site of metal ion binding has been designated the metal ion 



1 



dependent adhesion site (MIDAS). Metal ion binding sites in other proteins are also 
proximal to ligand binding, including, for example, GTP/GDP binding to GTPases, 
and cofactor (i.e., NAD and FAD) binding to the bacterial protein ENR. Previous 
work has shown that for at least some proteins, including GTPases, LFA-1 [Huth, et 
ai, Proc. NatL Acad. Sci. (USA) 97:5231-5236 (2000)], Mac-1 [Oxvig, et al, Proc, 
Natl. Acad. Sci. (USA) 96:2215-20 (1999)] and Alpha2 [Emsley, et aL, Cell 101:47- 
56 (2000)], ligand binding in the MIDAS region requires a conformation change 
between the active and inactive state of the protein. 

The integrin I domain structure has been characterized in detail. 
Among the integrins in which I domain structures have been identified, primary 
amino acid sequence comparison indicates that overall homology can vary widely 
among different integrin family members. Despite this divergence in homology, 
some residues are highly conserved in many integrins. Further, it has remained 
unclear whether the observed divergence in amino acid sequence homology gives rise 
to substantial differences in tertiary structure of the I domain within the individual 
subunits or the quaternary structure in the heterodimers. 

The I domains for [Lee et aL, Cell 80:631-638 (1995)], [Qu et 
al. Structure 4:931-942 (1996)], a, [Rich, J. Biol. Chem., 274:24906-24913 (1999)], 
and [Emsley et al, J. Biol. Chem., 272:28512-28517 (1997)] have been 
crystallized, thereby permitting detailed analysis of previously speculated functional 
regions. The otj^ crystalline structure clearly identified a Rossmann fold including a 
ligand-binding crevice formed along the top of the central, hydrophobic beta sheet, 
wherein the beta sheet is surrounded by multiple amphipathic a helices [Dickeson, et 
al. Cell. MoL Life. Sci. 54:556-566 (1998)]. Consistent with previous observations, 
crystalline I domains for both and have also been shown to include a MIDAS 
region. 

General structural observations fi*om the crystalline a^^ I domain appear 
to correlate to the crystalline structure of These observations clearly indicate that 
ccl undergoes a conversion fi*om an inactive to an active state before ligand binding 
can occur. This observation has been confirmed in NMR studies wherein ICAM-1 
binding to the aL I domain was shown to require positional perturbations of amino 



acid residues in the a^^ MIDAS region, as well as in a second region, still within the I 
domain but distal to the MIDAS region [Huth, et al, Proc. Natl. Acad. Sci. (USA) 
97:5231-5236(2000)]. 

Site directed mutagenesis in this second region has indicated that 
5 residues therein are not part of the ICAM-1 binding site, i.e,, these residues do not 

interact directly with the ligand, but that these residues do, at least in part, play a role 
in regulating ICAM-1 binding. Amino acid residues that comprise this region have 
been designated the I domain allosteric site (IDAS) [Id.], and it is postulated that this 
region undergoes and/or induces a functionally relevant conformational shift that may 

10 be modulated by a small molecule. If the overall tertiary structure is conserved in the 

I or A domains of other proteins, such a site could provide an attractive target for 
modulating ligand binding for these proteins. 

Furthermore, the crystal structure of the entire extracellular region of 
alphaVbeta3, an integrin, was recently reported [Cousin, Science, 293:1743-1746 

15 (September 7, 2001)]. The crystal structure confirms predictions that the beta subunit 

of all integrins contains an I domain. Because this I domain has been implicated in 
regulating integrin function, it is an additional potential site for modulating ligand 
binding for these proteins. Identification of such regulatory regions provides means 
by which modulators, te., agonists and antagonists, of ligand binding can be 

20 identified. Identification of such modulators provides candidate compounds that can 
provide protection against, and relief fi:om, the myriad of pathological states 
associated with aberrant activity of cc/p proteins. 

Accordingly, there exists a need in the art to identify modes of 
modulating a/p proteins, which have a wide variety of fimctions and primary 

25 structures, in such a manner as to influence their biological activity. 

SUMMARY OF THE INVENTION 

In one aspect, the present invention provides methods of modulating 
binding interaction between a first molecule which is not LFA-1 or an I domain- 
30 containing fi^agment thereof, and a binding partner molecule, said first molecule 

comprising an a/p domain structure, said a/p structure comprising an allosteric 
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regulatory site, said method comprising the step of contacting said first molecule with 
an allosteric effector molecule that interacts with said allosteric regulatory site and 
promotes a conformation in a ligand binding domain of said a/|3 structure that 
modulates binding between said first molecule and said binding partner molecule. As 
5 used herein, the term "a/(3 structure'* for a molecule refers to a general class of 

molecules that comprise a characteristic structure which is not necessarily indicative 
of, for example, molecules having multiple subunits which are designates as a and P 
subunits. This general class of molecules, however, can include molecules having 
multiple subunits which are designates as a and P subunits. The invention further 

1 0 provides methods of modulating binding interaction between a first molecule which is 
not LFA-1 or an I domain-containing fi:-agment thereof, and a binding partner 
molecule, said first molecule comprising an a/p domain structure, said a/p structure 
comprising an allosteric regulatory site, said method comprising the step of contacting 
said first molecule with an allosteric effector molecule, said allosteric effector 

15 molecule comprising a diaryl compound, said diaryl compound interacting with said 

allosteric regulatory site and promoting a conformation in a ligand binding domain of 
said (x/p structure that modulates binding between said first molecule and said binding 
partner molecule. In another aspect, the invention provides methods of modulating 
binding interaction between a first molecule which is not LFA-1 or an I domain- 

20 containing firagment thereof, and a binding partner molecule, said first molecule 

comprising an a/p domain structure, said a/p structure comprising an allosteric 
regulatory site, said method comprising the step of contacting said first molecule with 
an allosteric effector molecule, said allosteric effector molecule selected fi-om the 
group consisting of diaryl sulfide compounds and diarylamide compounds, said 

25 allosteric effector molecule interacting with said allosteric regulatory site and 

promoting a conformation in a ligand binding domain of said a/p structure that 
modulates binding between said first molecule and said binding partner molecule. 

In one embodiment, methods of the invention utihze a first molecule 
which comprises a Rossmann fold structure, said Rossmann fold structure comprising 

30 said allosteric regulatory site. As used herein, the term Rossmann fold structure 

encompasses Rossmann-like fold structures and dinucleotide fold structures, as is 
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known in the art. In the methods the Rossmann fold structure in the first molecule 
comprises a P sheet having p sheet strands positioned in a 321456 or 231456 
orientation. Alternatively, the Rossmann fold structure in the first molecule 
comprises a p sheet having p sheet strands positioned in a 3214567 orientation. In 
5 another aspect, the Rossmann fold structure in said first molecule comprises a P sheet 

having p sheet strands positioned in a 32145 orientation. As used herein, the term 
orientation refers to the positioning of the individual strands of a p sheet in a parallel, 
antiparallel or mixed configuration. Preferably, methods employ a first molecule 
which comprises an I domain structure or an A domain structure. 

1 0 * The invention further provides methods of modulating binding 

interaction between a first molecule and a binding partner molecule, said first 
molecule having an amino acid sequence which exhibits less than about 90% identity 
to the LFA-1 1 domain amino acid sequence set out in FIGURE 1, said first molecule 
comprising an a/p structure, said a/p domain structure comprising an allosteric 

15 regulatory site, said method comprising the step of contacting said first molecule with 

an allosteric effector molecule that interacts with said allosteric regulatory site and 
promotes a conformation in a ligand binding domain of said a/p structure that 
modulates binding between said first molecule and said binding partner molecule. 
The allosteric regulatory sites of the present invention include "I-like domains" or 

20 "IDAS-like domains," as well as IDAS domains. As used herein, the terms I-like 

domains and DDAS-like domains refer to regulatory sites discrete (i.e., 
distinguishable) firom the MIDAS region (in MIDAS-containing molecules), and 
discrete (i.e., distinguishable) fi-om ligand, substrate or co-factor binding sites, that do 
not necessarily include a complete I domain per se, but do undergo and/or induce a 

25 functionally relevant conformational shift that may be modulated by a small molecule 

to increase or decrease binding between a first molecule and a binding partner 
molecule. In another aspect, the invention provides methods of modulating binding 
interaction between a first molecule and a binding partner molecule, said first 
molecule having an amino acid sequence which exhibits less than about 90% identity 

30 to the LFA-1 1 domain amino acid sequence set out in FIGURE 1, said first molecule 

comprising an a/p structure, said a/p domain structure comprising an allosteric 
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regulatory site, said method comprising the step of contacting said first molecule with 
an allosteric effector molecule, said allosteric effector molecule comprising a diaryl 
compound, said diaryl compound interacting with said allosteric regulatory site and 
promoting a conformation in a ligand binding domain of said a/p structure that 

5 modulates binding between said first molecule and said binding partner molecule. In 

still another aspect, the invention provides methods of modulating binding interaction 
between a first molecule and a binding partner molecule, said first molecule having an 
amino acid sequence which exhibits less than about 90% identity to the LFA-1 I 
domain amino acid sequence set out in FIGURE 1 , said first molecule comprising an 

10 a/p domain structure, said cc/p structure comprising an allosteric regulatory site, said 

method comprising the step of contacting said first molecule with an allosteric 
effector molecule, said allosteric effector molecule selected fi*om the group consisting 
of diaryl sulfide compounds and diarylamide compounds, said allosteric effector 
molecule interacting with said allosteric regulatory site and promoting a conformation 

15 in a ligand binding domain of said a/p structure that modulates binding between said 

first molecule and said binding partner molecule. In a preferred embodiment, each of 
the methods the first molecule has an amino acid sequence that exhibits a percent 
identity with respect to the LFA-1 1 domain amino acid sequence less than about 40%, 
about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, 

20 about 80%, about 85%), or about 90%, In another aspect, the first molecule comprises 

a Rossmann fold structure, said Rossmann fold structure comprising an allosteric 
regulatory site and the first molecule has an amino acid sequence that exhibits a 
percent identity with respect to the LFA-1 1 domain amino acid sequence less than 
about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, 

25 about 75%, about 80%, about 85%>, or about 90%. In another aspect, the methods of 

the invention utilize a first molecule wherein the Rossmann fold structure in said first 
molecule comprises a p sheet having p sheet strands positioned in a 321456 or 231456 
orientation and the first molecule has an amino acid sequence that exhibits a percent 
identity with respect to the LFA-1 1 domain amino acid sequence less than about 40%, 

30 about 45%, about 50%, about 55%, about 60%, about 65%, about 10%, about 75%, 

about 80%), about 85%, or about 90%. In another aspect, the methods use a protein 
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wherein the Rossmann fold structure in said first molecule comprises a P sheet having 
p sheet strands positioned in a 3214567 orientation and the first molecule has an 
amino acid sequence that exhibits a percent identity with respect to the LFA-1 I 
domain amino acid sequence less than about 40%, about 45%, about 50%), about 55%, 
5 about 60%, about 65%, about 70%, about 75%>, about 80%, about 85%, or about 90%. 

In another aspect, the method utihze a first molecule with a Rossmann fold structure 
comprising a P sheet having p sheets strands positioned in a 32145 orientation, and 
the first molecule has an amino acid sequence that exhibits a percent identity with 
respect to the LFA-1 I domain amino acid sequence less than about 40%, about 45%, 

10 about 50%, about 55%, about 60%^, about 65%, about 70%., about 75%, about 80%, 

85%o, or about 90%. Preferably, the first molecule comprises an I domain structure 
and the first molecule has an amino acid sequence that exhibits a percent identity with 
respect to the LFA-1 I domain amino acid sequence less than about 40%, about 45%, 
about 50%), about 55%, about 60%, about 65%., about 70%, about 75%), about 80%, 

15 about 85%,or about 90%, In another preferred embodiment, the first molecule 

comprises an A domain structure and the first molecule has an amino acid sequence 
that exhibits a percent identity with respect to the LFA-1 1 domain amino acid 
sequence less than about 40%, about 45%), about 50%, about 55%, about 60%, about 
65%, about 70%), about 75%, about 80%), about 85%, or about 90%. 

20 In methods of the invention, the modulator promotes a conformation in 

the ligand binding domain of said first molecule that increases binding between said 
first molecule and said binding partner molecule, and in one aspect, the increase in 
binding between the first molecule and the second molecule results in increased 
enzymatic activity of the first molecule. In another embodiment, the modulator 

25 promotes a conformation in the ligand binding domain of said first molecule that 

decreases binding between said first molecule and said binding partner molecule and 
the decrease in binding between the first molecule and the second molecule results in 
decreased enzymatic activity of the first molecule. 

Methods include use of a first molecule selected fi-om the group 

30 consisting of the proteins set forth in Table 1 as well as other proteins which comprise 

I or A domains, G proteins, heterotrimeric G proteins, and tubulin GTPase. 
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Preferably, methods of the invention utilize a first molecule selected from the group 
consisting of the proteins set forth in Table 1 . In one aspect, the first molecules is a 
eukaryotic molecule. Preferably, the first molecule is a human molecule. Li another 
aspect, the first molecule is a prokaryotic molecule, hi one embodiment, the first 
5 molecule is a bacterial molecule. 

More preferably, the first molecule is selected fi*om the group 
consisting of aj^Pj, complement protein C2, complement protein Factor B, a^Py, 0£4P7, 
cCyP3, a^^^, aj^27^^^ Willebrand factor, Rac-1, HPPK, ftsZ, and ENR. hi methods 
wherein the first molecule is aj^P2 ^^e binding partner protein is fibrinogen; the 

10 first molecule is a^^2 the binding partner protein is iC3b; the first molecule is 

a^py and the binding partner protein is E-cadherin; the first molecule is a^^j and the 
binding partner protein is MadCAM-1; the first molecule is ayPa and the binding 
partner protein is vitronectin; the first molecule is a^^^ and the binding partner protein 
is VCAM; the first molecule is aj^2 binding partner protein is VCAM; the 

1 5 first molecule is von Willebrand factor and the binding partner protein is gplb; the 

first molecule is complement protein C2 and the binding partner protein is 
complement protein C4b; the first molecule is complement protein Factor B and the 
binding partner protein is complement protein C3b; the first molecule is Rac-1 and the 
binding partner is GTP; the first molecule is HPPK and the binding partner is ATP or 

20 HMDP; the first molecule is ftsZ and the binding partner is GTP; and the first 

molecule is ENR and the binding partner is NADH. 

DETAILED DESCRIPTION OF THE INVENTION 

In one aspect, the present invention provides methods of modulating 
25 binding interaction between a first molecule vs^hich is not LFA-1 or an I domain- 

containing fi-agment or mimetics thereof, and a binding partner molecule, said first 
molecule comprising an a/p structure^ said a/p structure comprising an allosteric 
regulatory site, said method comprising the step of contacting said first molecule with 
an allosteric effector molecule that interacts with said allosteric regulatory site and 
30 promotes a conformation in a ligand binding domain of said a/p structure that 

modulates binding between said first molecule and said binding partner molecule. As 
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used herein, "binding partner molecules" includes ligands, substrates and cofactor, the 
binding of which is required to effect one or more biological activity of the first 
molecule. An I domain fragment of LFA-1 is a polypeptide portion or fragment {i.e., 
a polypeptide that is less than full length LFA-1 as set out in FIGURE 2) of LFA-1 
5 that comprises (i) the I domain of LFA-1 , or (ii) a portion of the LFA-1 I domain that 

maintains biologically active features of the LFA-1 I domain. Synthetic mimetics of 
the LFA-1 I domain, including peptidomimetics which replicate or affect one or more 
biological activities of the LFA-1 I domain, are also included in this definition. The 
a/p superfamily of proteins includes those proteins having an beta-alpha-beta 

10 structure wherein a central beta sheet domain is flanked on both sides of the sheet by 

one or more alpha helix domains. 

In another aspect, the present invention provides methods of 
modulating binding interaction between a first molecule which is not LFA-1 or an I 
domain-containing firagment or mimetics thereof, and a binding partner molecule, said 

15 first molecule comprising a Rossmaim fold structure, said Rossmann fold structure 

comprising an allosteric regulatory site, said method comprising the step of contacting 
said first molecule with an allosteric effector molecule that interacts with said 
allosteric regulatory site and promotes a conformation in a ligand binding domain of 
said Rossmaim fold structure that modulates binding between said first molecule and 

20 said binding partner molecule. A Rossmann fold structure in a protein compnses a 

beta sheet structure wherein individual beta sheet domains of the protein are 
positioned in either parallel, antiparallel, or mixed orientations. In preferred aspects 
of the present invention, the beta sheet of the first molecule is comprised of individual 
beta sheet strands. Nimierical designations for the individual beta sheet strands are 

25 assigned according to their position in the primary amino acid sequence of the first 

protein, with the first beta sheet strand being that one closest to the amino terminus of 
the protein sequence. Rossmann fold structures are further characterized by the 
presence of a ligand binding fold, pocket, or site in the three dimensional structure of 
the beta sheet that is generally positioned at the "top" of the beta sheet structure. 

30 In another aspect, the present invention provides methods of 

modulating binding interaction between a first molecule which is not LFA-1 or an I 
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domain-containing fragment or mimetic thereof, and a binding partner molecule, said 
first molecule comprising a Rossmann fold structure, said Rossmann fold structure 
comprising a p sheet having p strands positioned in a 321456 or 231456 orientation 
and an allosteric regulatory site, said method comprising the step of contacting said 
5 first molecule with an allosteric effector molecule that interacts with said allosteric 

regulatory site and promotes a conformation in a ligand binding domain of said 
Rossmann fold structure that modulates binding between said first molecule and said 
binding partner molecule, hi another aspect, the present invention provides methods 
of modulating binding interaction between a first molecule which is not LFA-1 or an I 

10 domain-containing fi*agment or mimetic thereof, and a binding partner molecule, said 

first molecule comprising a Rossmann fold structure, said Rossmann fold structure 
comprising a p sheet having p strands positioned in a 3214567 orientation and an 
allosteric regulatory site, said method comprising the step of contacting said first 
molecule with an allosteric effector molecule that interacts with said allosteric 

1 5 regulatory site and promotes a conformation in a ligand binding domain of said 

Rossmann fold structure that modulates binding between said first molecule and said 
binding partner molecule. The present invention also provides methods of modulating 
binding interaction between a first molecule which is not LFA-1 or an I domain- 
containing fragment or mimetic thereof, and a binding partner molecule, said first 

20 molecule comprising a Rossmann fold structure, said Rossmann fold structure 
comprising a p sheet having p strands positioned in a 32145 orientation and an 
allosteric regulatory site, said method comprising the step of contacting said first 
molecule with an allosteric effector molecule that interacts with said allosteric 
regulatory site and promotes a conformation in a ligand binding domain of said 

25 Rossmann fold structure that modulates binding between said first molecule and said 

binding partner molecule. Numerical designations for individual beta sheets in the 
first molecule are as described above. 

hi another aspect, the present invention provides methods of 
modulating binding interaction between a first molecule which is not LFA-1 or an I 

30 domain-containing fi-agment or mimetic thereof, and a binding partner molecule, said 

first molecule comprising an I domain structure, said I domain structure comprising 
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an allosteric regulatory site, said method comprising the step of contacting said first 
molecule with an allosteric effector molecule that interacts with said allosteric 
regulatory site and promotes a conformation in a ligand binding domain of said 1 
domain structure that modulates binding between said first molecule and said binding 
5 partner molecule. I domain structures are known in the art to comprise approximately 

200 amino acids as exemplified by the domains identified in a number of integrins 
[See Dickeson, et al. Cell. MoL Life Sci. 54:556-566 (1998)]. 

The present invention also provides methods of modulating binding 
interaction between a first molecule which is not LFA-1 or an I domam-containing 

10 fi:*agment thereof, and a binding partner molecule, said first molecule comprising an A 
domain structure, said A domain structure comprising an allosteric regulatory site, 
said method comprising the step of contacting said first molecule with an allosteric 
effector molecule that interacts with said allosteric regulatory site and promotes a 
conformation in a ligand binding domain of said A domain structure that modulates 

15 binding between said first molecule and said binding partner molecule. A domain 

motifs are known in the art to share homology with I domains and are exemplified by 
the domains found in von Willebrand factor. 

The present invention also provides methods of modulating binding 
interaction between a first molecule and a binding partner molecule, said first 

20 molecule having an amino acid sequence which exhibits less than about 90% identity 

to the LFA-1 I domain amino acid sequence [set out in FIGURE 1], said first 
molecule comprising an a/p structure, said a/p structure comprising an allosteric 
regulatory site, said method comprising the step of contacting said first molecule with 
an allosteric effector molecule that interacts with said allosteric regulatory site and 

25 promotes a conformation in a ligand binding domain of said a/p structure that 

modulates binding between said first molecule and said binding partner molecule. 
Identity as used herein can be calculated using basic BLAST analysis using default 
parameters. Values for percent identity reflect one-to-one correspondence between 
amino acid residues across the entire LFA-1 sequence I domain as set out in FIGURE 

30 1 and a region of amino acid residues of the same or similar length in the first 

molecule. In another embodiment of the method, the first molecule has an amino acid 
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sequence that exhibits a percent identity with respect to the LFA-1 I domain amino 
acid sequence of less than about 40%, about 45%, about 50%, about 55%o, about 
60%, about 65%>, about 70%, about 75%, about 80%, about 85%, or about 90%. 

In still another aspect, the present invention provides methods of 
5 modulating binding interaction between a first molecule and a binding partner 

molecule, said first molecule having an amino acid sequence which exhibits less than 
about 90% identity to the LFA-1 I domain amino acid sequence [set out in FIGURE 
1], said first molecule comprising a Rossmann fold structure, said Rossmann fold 
structure comprising an allosteric regulatory site, said method comprising the step of 

10 contacting said first molecule with an allosteric effector molecule that interacts with 

said allosteric regulatory site and promotes a conformation in a ligand binding domain 
of said Rossmann fold structure that modulates binding between said first molecule 
and said binding partner molecule. In alternative embodiments of the method, the 
first molecule has an amino acid sequence that exhibits a percent identity with respect 

15 to the LFA-1 1 domain amino acid sequence of less than about 40%, about 45%>, 

about 50%, about 55%>, about 60%), about 65Vo, about 70%, about 15%, about 80%>, 
about 85%>, or about 90%. 

In another aspect, the present invention provides methods of 
modulating binding interaction between a first molecule and a binding partner 

20 molecule, said first molecule having an amino acid sequence which exhibits less than 

about 90% identity to the LFA-1 1 domain amino acid sequence [set out in FIGURE 
1], said first molecule comprising a Rossmann fold structure with p sheets strands 
positioned in a 321456 or 231456 orientation and an allosteric regulatory site, said 
method comprising the step of contacting said first molecule with an allosteric 

25 effector molecule that interacts with said allosteric regulatory site and promotes a 

conformation in a ligand binding domain of said Rossmann fold structure that 
modulates binding between said first molecule and said binding partner molecule. In 
alternative embodiments of the method, the first molecule has an amino acid sequence 
that exhibits a percent identity with respect to the LFA-1 1 domain amino acid 

30 sequence of less than about 40%>, about 45%, about 50%, about 55%, about 60%, 

about 65%o, about 70%o, about 75%», about 80%, about 85%>, or about 90%. 
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In still another aspect, the present invention provides methods of 
modulating binding interaction between a first molecule and a binding partner 
molecule, said first molecule having an amino acid sequence which exhibits less than 
about 90% identity to the LFA-1 1 domain amino acid sequence [set out in FIGURE 
5 1], said first molecule comprising a Rossmann fold structure^ said Rossmann fold 

structure with P sheet strands positioned in a 3214567 orientation and an allosteric 
regulatory site, said method comprising the step of contacting said first molecule with 
an allosteric effector molecule that interacts with said allosteric regulatory site and 
promotes a conformation in a ligand binding domain of said Rossmann fold structure 

10 that modulates binding between said first molecule and said binding partner molecule. 

In alternative embodiments of the method, the first molecule has an amino acid 
sequence that exhibits a percent identity with respect to the LFA-1 1 domain amino 
acid sequence of less than about 40%, about 45%, about 50%, about 55%, about 
60%, about 65%, about 70%, about 75%, about 80%, about 85%, or about 90%, 

1 5 The present invention also provides methods of modulating binding 

interaction between a first molecule and a binding partner molecule, said first 
molecule having an amino acid sequence which exhibits less than about 90% identity 
to the LFA-1 1 domain amino acid sequence [set out in FIGURE 1], said first 
molecule comprising a Rossmann fold structure p sheet strands positioned in a 32145 

20 orientation and an allosteric regulatory site, said method comprising the step of 

contacting said first molecule with an allosteric effector molecule that interacts with 
said allosteric regulatory site and promotes a conformation in a ligand binding domain 
of said Rossmann fold structure that modulates binding between said first molecule 
and said binding partner molecule. In alternative embodiments of the method, the 

25 first molecule has an amino acid sequence that exhibits a percent identity with respect 

to the LFA-1 I domain amino acid sequence of less than about 40%, about 45%, 
about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, 
about 85%, or about 90%. 

The present invention further provides methods of modulating binding 

30 interaction between a first molecule and a binding partner molecule, said first 

molecule having an amino acid sequence which exhibits less than about 90% identity 
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to the LFA-1 I domain amino acid sequence [set out in FIGURE 1], said first 
molecule comprising an I domain structure, said I domain structure comprising an 
allosteric regulatory site, said method comprising the step of contacting said first 
molecule with an allosteric effector molecule that interacts with said allosteric 
5 regulatory site and promotes a conformation in a ligand binding domain of said I 

domain structure that modulates binding between said first molecule and said binding 
partner molecule. In alternative embodiments of the method, the first molecule has an 
amino acid sequence that exhibits a percent identity with respect to the LFA-1 I 
domain amino acid sequence of less than about 40%, about 45%, about 50%, about 
10 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, or about 

90%. 

In another aspect, the present invention provides methods of 
modulating binding interaction between a first molecule and a binding partner 
molecule, said first molecule having an amino acid sequence which exhibits less than 

15 about 90% identity to the LFA-1 1 domain amino acid sequence [set out in FIGURE 

1], said first molecule comprising an A domain structure, said A domain structure 
comprising an allosteric regulatory site, said method comprising the step of contacting 
said first molecule with an allosteric effector molecule that interacts with said 
allosteric regulatory site and promotes a conformation in a ligand binding domain of 

20 said A domain structure that modulates binding between said first molecule and said 

binding partner molecule. In alternative embodiments of the method, the first 
molecule has an amino acid sequence that exhibits a percent identity with respect to 
the LFA-1 1 domain amino acid sequence of less than about 40%, about 45%, about 
50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 

25 85%, or about 90%. 

In each of the methods of the present invention, the modulator 
promotes a conformation in the ligand binding domain of said first molecule that 
increases binding between said first molecule and said binding partner molecule. 
Alternatively, the modulator promotes a conformation in the ligand binding domain of 

30 said first molecule that decreases binding between said first molecule and said binding 

partner molecule. Preferably, the methods include a first molecule selected from the 
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group consisting of the molecules set out in Table 1 or otherwise described herein. 
Most preferably, methods utilize a first molecule selected from the group consisting of 
oif^p2' complement protein C2, complement protein Factor B, a^Py, ccyPj, 
^A^u^^i Willebrand factor, Rac-1, HPPK, ftsZ, and ENR, Furthermore, 
5 preferably, the methods and compositions of the present invention use a modulator 

that is a diaryl compound. More preferably, the methods and compositions of the 
present invention use a modulator that is selected from diaryl sulfide compounds and 
diarylamide compounds. Most preferably, the methods and compositions of the 
present invention use a modulator that is a diaryl sulfide compound. 

10 hi methods wherein the first molecule is ^^^2^ the preferred binding 

partner protein is fibrinogen, and a preferred modulator is selected from the group 
consisting of Cmpd S, Cmpd R, Cmpd N, Cmpd O, Cmpd P, Cmpd Q, Cmpd L, 
Cmpd V, Cmpd F, Cmpd AA, and Cmpd AC as set out in Table 2. hi methods 
wherein the first molecule is o:j^p2, an alternative preferred binding partner protein is 

1 5 iC3b and a preferred modulator is selected fi*om the group consisting of Cmpd H, 

Cmpd I and Cmpd C. hi methods wherein the first molecule is a^^-j, the preferred 
binding partner protein is E-cadherin and a preferred modulator is selected from the 
compounds set out in Table 2 herein, hi methods wherein the first molecule is a^%^ 
the preferred binding partner protein is MAdCAM-L hi methods wherein the first 

20 molecule is ctyP^, the preferred binding partner protein is vitronectin, hi methods 

wherein the first molecule is a4Pi, the preferred binding partner protein is VCAM. hi 
methods wherein the first molecule is a^p2, the preferred binding partner protein is 
VCAM. hi methods wherein the first molecule is von Willebrand factor, the preferred 
binding partner protein is gplb. hi methods wherein the first molecule is complement 

25 protein C2 , the preferred binding partner protein is complement protein C4b. hi 

methods wherein the first molecule is complement protein Factor B, the preferred 
binding partner protein is complement protein C3b. hi methods wherein the first 
molecule is either ajPi, oc2Pi> <^uPi* preferred binding partner is collagen, hi 
methods wherein the first molecule is a2pij the preferred binding partner is collagen 

30 and a preferred modulator is selected from the group of compounds set out in Table 2 

herein, hi methods wherein the first molecule is Rac-1, the preferred binding partner 



15 



is GDP/GTP and a preferred modulator GTP. In methods wherein the first molecule 
is HPPK, the preferred binding partner is ATP or HMDP. hi methods wherein the 
first molecule is flsZ, the preferred binding partner is GTP. In methods wherein the 
first molecule is ENR, the preferred binding partner is NADH. 
5 Methods of the present invention include those wherein the first 

molecule, the binding partner molecule or both are isolated proteins, or binding 
fragments thereof, obtained from natural sources or fi*om cells modified to express the 
molecules as heterologous proteins. The methods also embrace use of the first 
molecule, or a binding fragment thereof, the binding partner molecule, or a binding 
1 0 fi*agment thereof, both which are expressed on the surface of cells which express the 

molecules as homologous proteins or on the surface of cells which have been 
modified to express heterologous proteins. In vivo and in vitro methods are 
contemplated. 

In vivo methods are expected to alleviate and/or prevent pathological 

15 states which arise firom aberrant binding activity between the first molecule and the 

binding partner molecule. For example, indications associated with inappropriate 
complement activation for which methods of the present invention are expected to 
alleviate or prevent include: (i) diseases involving antibody/complement deposition 
which includes systemic lupus erythematosus (SLB), Goodpasture's disease, 

20 rheumatoid arthritis, myasthenia gravis, autoimmune hemolytic anemia, autoimmune 

thrombocytopenic purpura, and Rasmussen's encephalitis; (ii) diseases involving 
ischemia-reperfiision injmy, including stroke, myocardial infarction, cardiac 
pulmonary bypass, acute hypovolemic disease, renal failure, and allotransplantation; 
(iii) central nervous system pathologies such as Alzheimer's disease and multiple 

25 sclerosis; and (iv) miscellaneous indications such as trauma, chemical or thermal 

injury, and xenotransplantation. 

Likewise, inhibitors of alpha 1, alpha 2, and alpha 1 1 are also expected 
to be useful for treating cancer. During metastasis, tumor cells must pass through the 
extracellular matrix prior to intravasation and following extravasation. Migration 

30 through these regions is dependent on integrin activity. In addition, it has been shown 

that blocking of a j or 0.2 activity with monoclonal antibodies [Locher et al^ Mol, Biol. 
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CelL 10:271-282 (1999)] or removal of aj activity in a knockout mouse [Pozzi, et al, 
Proc. NatL Acad. Sci. (USA) 97:2202-2207 (2000)] results in changes in matrix 
metalloproteinase (MMP) levels. MMPs are extracellular matrix-degrading enzymes 
which have been proposed to play a role in a variety of types of cancer, [For a review, 
5 see Nelson, et. al., J. Clm. Oncol. 18:1 135-1 149 (2000)]. Inhibitors of MMPs are 

currently being tested for clinical utility in treating many types of cancer. To date, 
MMP inhibitors have not been as effective in human trials as in animal models. 
Modulating MMP expression by inhibiting integrin activity can prove to be more 
effective by differentially modulating different MMP levels and by specifically 

1 0 targeting this MMP modulation to a,, or a^^ expressing cells. 

More particularly, it has been demonstrated that alpha 1 1 is expressed 
on foamy macrophages in atherosclerotic plaques as well as in a subset of 
macrophages in synovium from a patient with rheumatoid arthritis. No expression has 
been seen in non-activated monocyte derived macrophages, hihibitors of alpha 

15 1 1/ligand binding interactions could therefore be useful for reducing migration and/or 

signaling events of macrophages that are associated with different inflammatory 
processes. Accordingly, alpha 1 1 inhibitors could represent useful therapeutics for 
treating inflammatory diseases, including atherosclerosis and rheumatoid arthritis. 

Similarly, alpha 1 and alpha 2 integrins have been shown to be 

20 upregulated on certain cells (including T cells and monocytes) following stimulatioil. 

It has also been demonstrated that blocking interactions between alpha 1 or alpha 2 
and their ligands using monoclonal antibodies inhibited inflammatory responses in 
mouse models of delayed-type hypersensitivity, contact hypersensitivity and arthritis 
[deFougeroUes e?. al. J, Clin, Invest 705:721-729(2000)]. Antagonists of alpha 1 and 

25 alpha 2 may inhibit inflammation through a variety of mechanisms including 

inhibiting cell migration, cell proliferation and the production of inflammatory 
mediators such as matrix metalloproteinase 3, tumor necrosis factor alpha and 
interleukin-1. Accordingly, small molecule inhibitors or antagonists of alphal and 
alpha2 associations (ligand binding), i.e.^ allosteric effector molecules, could be useful 

30 for the treatment of inflammatory diseases such as arthritis, fibrotic diseases and 

cancer. 
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Fibrotic disease states are characterized by the excessive production of 
fibrous extracellular matrix by certain cell types that are inappropriately activated. It 
is believed that the mechanism of fibrous extracellular matrix formation involves^ at 
least in part, a/p protein activity. Accordingly, by inhibiting a/(i proteins, the present 
5 invention provides methods and compositions for the treatment and prevention of 

various fibrotic disease states, including scleroderma (morphea, generalized morphea, 
linear scleroderma), keloids, hypertrophic scar, nodular fasciitis, eosinophihc fasciitis, 
Dupuytren's contracture, kidney fibrosis, pulmonary fibrosis, chemotherapy / radiation 
induced lung fibrosis, atherosclerotic plaques, inflammatory bowel disease, Crohn's 

1 0 disease, arthritic joints, invasive breast carcinoma desmosplasis, dermatofibromas, 

endothelial cell expression, angiolipoma, angioleiomyoma, sarcoidosis, cirrhosis, 
idiopathic interstitial lung disease, idiopathic pulmonary fibrosis (4 pathologic types), 
collagen vascular disease associated lung syndromes, cryptogenic organizing 
pneumonia, Goodpasture's syndrome, Wegener's granulomatosis, eosinophilic 

15 granuloma, iatrogenic lung disease, pneumoconioses (asbestosis, silicosis), 

hypersensitivity pneumonitides (farmer's lung, bird fancier's limg, etc.), interstitial 
pulmonary fibrosis, chemical pneumonitis, hypersensitivity pneumonitis and the like. 

With respect to bacterial proteins, ENR is ab*eady a target for anti- 
tuberculosis drugs and a target of the broad spectrum biocide triclosan. Small 

20 molecules v/ould therefore be usefiil in drug resistant tuberculosis. Moreover, the 

activity spectrum of ENR and DapB inhibitors would be usefiil as Gram negative 
inhibitors. Furthermore, because ERA-GTPase is highly conserved among bacteria, 
inhibitors would be usefiil against a broad spectrum of bacteria, depending on 
permeability. In addition, inhibitors of the various bacterial proteins would be usefiil 

25 for treating bacterial diseases involving Gram negative bacteria and infections with 

undefined bacterial pathogens. 

Other chemotherapeutics, such as sulfonamides, inhibit bacterial 
growth by antagonizing the de novo folate biosynthetic pathway [Mandell and Petri, 
Sulfonamides, Trimethoprim-sulfamethoxazole, Quinolones, and Agents for Urinary 

30 Tract Infections^ in The Pharmacological Basis of Therapeutics (Goodman and 

Gilman eds., 1996)]. The primary goal of anti-folate therapy is to deplete the 
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intracellular pools of reduced folate, resulting in the inhibition of DNA replication 
due to insufficient levels of thymidine [Hitchings and Baccanari, Design and 
Synthesis of Folate Antagonists as Antimicrobial Agents, in Folate Antagonists as 
Therapeutic Agents (1984)], 
5 The enzyme 6-hydroxymethyI-7,8-dihydropterin pyrophosphokinase 

(HPPK) catalyzes the transfer of pyrophosphate from ATP to 6-hydroxy-7,8- 
dihydropterin (HMDP) in the de novo folate biosynthetic pathway [Richey and 
Brown, J. Biol Chem., 244:1582-1592 (1969)]. HPPK is expressed in both Gram 
positive and Gram negative bacteria, fungi, and protozoa, but not in higher 
10 eukaryotes, and represents an important target for the development of antibiotics with 

anti -folate activity. By inhibiting HPPK, the present invention can provide methods 
and compositions for the treatment and prevention of various bacterial and fungal 
infections. 

FtsZ is the product of an essential bacterial gene that is involved in cell 
15 division, FtsZ binds and hydrolyzes GTP, and when bound to GTP it forms long, 

linear polymers. The GTP-dependent polymerization of ftsZ is related to its function 
in bacterial cell division. During septation, ftsZ forms a ring to define the plane of 
cell division. Ceils lacking ftsZ can not imdergo septation, do not divide and die. 
FtsZ is highly conserved (approximately 60%) throughout the bacterial kingdom. 
20 Accordingly, by inhibiting flsZ, the compositions and methods of the present 

invention provide broad-spectrum antibiotics. The atomic structure of flsZ shows that 
it is an alpha/beta protein [Nogales etal, (1998) Nature Structural Biology 5:451- 
458]. 

Modulators of vWF binding are useful in treatment of thrombotic 
25 vascular diseases, such as myocardial infarction (MI) and thrombotic stroke. Acute 

administration of a vWF Al -domain binding antagonist can reduce the risk of 
coronary vascular occlusion in high risk patients such as those with unstable angina, 
or following PTCA or stent placement. Several gpIIb/IIIa antagonists have recently 
been approved for clinical use in these settings (ReoPro®, Itrafiban, sibrafiban). 
30 While these agents are effective, their use is accompanied by bleeding, thus limiting 

their effective dose. If the bleeding side effects of an Al -domain inhibitor are limited. 
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it can be used chronically in individuals at risk for vascular occlusion. These 
individuals include patients with angina, claudication, and those with a history of MI 
or stroke. Abnormalities of vWF metabolism are the cause of the occlusive thrombus 
in thrombotic thrombocytopenic purpura, suggesting Al domain inhibitors may also 
5 be useful in this setting. 

Racl, Rac2 and Rac3 are members of the Ras superfamily of small 
molecular weight (approximately 22-25kDa) GTPases, many of which are a/p 
proteins [Edwards and Perkins, FEBS Lett 358:283 (1995); De Vos et al. Science 
239:888 (1988); Worthylake et al. Nature 408:682 (2000)]. Primary amino acid 

10 sequence comparison indicates that the overall homology of the Rac proteins is about 

88 to about 92 percent identical. It is known that Racl and Rac2 proteins play a 
crucial role in cell survival, proliferation, metastasis and reactive oxygen species 
(ROS) production [Symons, Curr, Opin, in Biotech., 6:668 (1995); and, Scita, EMBO 
J., 19(1 1):2393 (2000)]. Due to the importance of Rac proteins in the control of cell 

15 proliferation, antagonists of the Rac guanine nucleotide exchange reaction and, in 

particular, small molecules that interfere with the exchange of GDP for OTP of Racl 
in the presence of Tiaml, are of considerable interest for the methods and 
compositions of the present invention. 

In view of the indications described above, the present invention 

20 further provides methods for alleviating or preventing a condition arising from 

aberrant binding between a first molecule that is not LFA-1 or an I domain fragment 
thereof and a binding partner molecule, wherein said first molecule is an a/p protein 
selected from the group of proteins set forth in Table 1 , said method comprising the 
steps of administering to an individual in need thereof an effective amount of a 

25 modulator of binding between said first molecule and said binding partner molecule. 

As used herein, the term effective amount refers to the administration of an amount of 
a modulator sufficient to achieve its intended purpose. More specifically, a 
"therapeutically effective amount" refers to an amount effective to treat or to prevent 
development of, or to alleviate the existing symptoms of, the subject being treated. 

30 Determination of the effective amounts is well within the capability of those skilled in 

the art, especially in light of the detailed disclosure provided herein. 
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In one aspect, the present invention provides methods of treatment 
wherein the oc/p protein comprises a Rossmaim fold. In another aspect, methods of 
treatment are provided wherein the Rossmann fold in the targeted protein includes 
five, six or seven p strands which makeup the central p sheet structure. When the 
5 Rossmann fold comprises five p strands, it is preferred that the positioning of the 

individual strands is 32145 as defined above. When the Rossmann fold comprises six 
p strands, it is preferred that the positioning of the individual strands is 321456 or 
231456 as defined above. When the Rossmann fold comprises seven p strands, it is 
preferred that the positioning of the individual strands is 3214567 as defined above. 

10 Methods of treatment the present invention include those wherein the first molecule 

exhibits less than about 90% amino acid sequence identity with the I domain amino 
acid sequence of LFA-1 as set out in FIGURE L Preferably, the first molecule will 
have a percent amino acid sequence identity with the I domain of LFA-1 less than 
about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, 

15 about 75%, about 80%, about 85%, or about 90%. Sequence identity for purposes of 

this aspect of the present invention is calculated using, for example, basic BLAST 
search analysis with default parameters. 

The present invention also provides methods for identifying a 
modulator of binding between a first molecule that is not LFA-1 or an I domain 

20 fi-agment thereof and a binding partner molecule, wherein said first molecule is an 

a/p protein selected fi-om the group of proteins set forth in Table 1, said method 
comprising the steps of measuring binding between the first molecule and the binding 
partner molecule in the presence and absence of a test compound, and identifying the 
test compound as a modulator of binding when a change in binding between the first 

25 molecule and the binding partner molecule is detected in the presence of the test 

compound as compared to binding in the absence of the test compound. In one 
aspect, the present invention provides methods wherein the a/p protein comprises a 
Rossmann fold. In another aspect, methods are provided wherein the Rossmann fold 
in the targeted protein includes five, six or seven p strands which makeup the central 

30 p sheet structure. When the Rossmarm fold comprises five p strands, it is preferred 

that the positioning of the individual strands is 32145 as defined above. When the 
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Rossmann fold comprises six p strands, it is preferred that the positioning of the 
individual strands is 321456 231456 as defined above. When the Rossmann fold 
comprises seven p strands, it is preferred that the positioning of the individual strands 
is 3214567 as defined above. Methods of the present invention include those wherein 
the first molecule exhibits less than about 90% amino acid sequence identity with the 
I domain amino acid sequence of LFA-1 as set out in FIGURE 1. Preferably, the first 
molecule will have a percent amino acid sequence identity with the I domain of LFA- 
1 less than about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, 
about 70%, about 75%, about 80%, about 85%, or about 90%. Sequence identity for 
purposes of this aspect of the present invention is calculated using, for example, basic 
BLAST search analysis with default parameters. 

The present invention also provides modulators of binding between a 
first molecule that is not LFA-1 or an I domain fragment thereof and a binding partner 
molecule, wherein said first molecule is an a/p protein selected firom the group of 
proteins set forth in Table 1. In one aspect, the modulators are those that affect 
binding of an a/p protein which comprises a Rossmann fold. In another aspect, 
modulators are provided which affect binding when the Rossmann fold in the targeted 
protein includes five, six or seven p strands which makeup* the central p sheet 
structure. When the Rossmann fold comprises five p strands, it is preferred that the 
positioning of the individual strands is 32145 as defined above. When the Rossmaim 
fold comprises six p strands, it is preferred that the positioning of the individual 
strands is 321456 or 231456 as defined above. When the Rossmann fold comprises 
seven p strands, it is preferred that the positioning of the individual strands is 3214567 
as defined above. Modulators are also provided for a first molecule which exhibits 
less than about 90% amino acid sequence identity with the I domain amino acid 
sequence of LFA-1 as set out in FIGURE 1 . Preferably, the first molecule will have a 
percent amino acid sequence identity with the I domain of LFA-1 less than about 
40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 
75%, about 80%, about 85%, or about 90%, . Sequence identity for purposes of this 
aspect of the present invention is calculated using, for example, basic BLAST search 
analysis with default parameters. 
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The present invention also provides compositions comprising a 
modulator. Preferred compositions are pharmaceutical compositions. The 
pharmaceutical compositions of the present invention comprise one or more 
modulators of the present invention, preferably further comprising a pharmaceutically 
acceptable carrier or diluent. The term "pharmaceutically acceptable carrier" as used 
herein refers to compounds suitable for use in contact with recipient animals, prefer- 
ably mammals, and more preferably humans, and having a toxicity, irritation, or 
allergic response commensurate with a reasonable benefit/risk ratio, and effective for 
their intended use. 

The present invention also provides modulators which exist in a 
prodrug form. The term "prodrug" as used herein refers to compounds which are 
rapidly transformed in vivo to the parent, or active modulator, compound , for 
example, by hydrolysis. A thorough discussion is provided in Higuchi, et al. 
Prodrugs as Novel Dehv er\^ Systems, vol, 14 of the A.C.S.D. Symposium Series, and 
in Roche (ed), Bioreversible Carriers in Drug Design . American Pharmaceutical 
Association and Pergamon Press, 1987, both of which are incorporated herein by 
reference. Prodrug design is discussed generally in Hardm.a, et al, (Eds), Goodman 
& Oilman's The Pharm acological Basis of Therapeutics . Ninth Edition, New York, 
New York (1996), pp. 1 1-16. Briefly, administration of a drug is followed by 
elimination from the body or some biotransformation whereby biological activity of 
the drug is reduced or eliminated. Alternatively, a biotransformation process may 
lead to a metabolic by-product which is itself more active or equally active as 
compared to the drug initially administered. Increased understanding of these 
biotransformation processes permits the design of so-called "prodrugs" which, 
following a biotransformation, become more physiologically active in an altered state. 
Prodrugs are therefore pharmacologically inactive compounds which are converted to 
biologically active metabolites. In some forms, prodrugs are rendered 
pharmacologically active through hydrolysis of, for example, an ester or amide 
linkage, often times introducing or exposing a functional group on the prodrug. The 
thus modified drug may also react with an endogenous compound to form a water 
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soluble conjugate which further increases pharmacological properties of the 
compound, for example, as a resuh of increased circulatory half-life. 

As another alternative, prodrugs can be designed to undergo covalent 
modification on a functional group with, for example, glucuronic acid sulfate, 
glutathione, amino acids, or acetate. The resulting conjugate may be inactivated and 
excreted in the urine, or rendered more potent than the parent compound. High 
molecular weight conjugates may also be excreted into the bile, subjected to 
enzymatic cleavage, and released back into circulation, thereby effectively increasing 
the biological half-life of the originally administered compound. 

Compounds of the present invention may exist as stereoisomers where 
asymmetric or chiral centers are present. Stereoisomers are designated by either "S" 
or "R" depending on the arrangement of substituents around a chiral carbon atom. 
Mixtures of stereoisomers are contemplated by the present invention. Stereoisomers 
include enantiomers, diastereomers, and mixtures thereof, Lidividual stereoisomers of 
compounds of the present invention can be prepared synthetically from commercially 
available starting materials which contain asymmetric or chiral centers or by 
preparation of racemic mixtures followed by separation or resolution techniques well 
known in the art. Methods of resolution include (1) attachment of a mixture of 
enantiomers to a chiral auxiliary, separation of the resulting mixture by 
recrystallization or chromatography, and liberation of the optically pure product from 
the auxiliary; (2) salt formation employing an optically active resolving agent, and (3) 
direct separation of the mixture of optical enantiomers on chiral chromatographic 
columns. 

The pharmaceutical compositions of the present invention can be 
administered to humans and other animals by any suitable route. For example, the 
compositions can be administered orally, rectally, parenterally, intracistemally, 
intravaginally, intraperitoneally, topically (as by powders, ointments, or drops), 
bucally, or nasally. The term "parenteral" administration as used herein refers to 
modes of administration which include intravenous, intraarterial, intramuscular, 
intraperitoneal, intrastemal, intrathecal, subcutaneous and intraarticular injection and 
infusion. 
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Pharmaceutical compositions of this present invention for parenteral 
injection comprise pharmaceutically-acceptable sterile aqueous or nonaqueous 
solutions, dispersions, suspensions or emulsions as well as sterile powders for 
reconstitution into sterile injectable solutions or dispersions just prior to use. 
5 Examples of suitable aqueous and nonaqueous carriers, diluents, solvents or vehicles 

include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene 
glycol, and the like), and suitable mixtures thereof, vegetable oils (such as olive oils), 
and injectable organic esters such as ethyl oleate. Proper fluidity can be maintained, 
for example, by the use of coating materials such as lecithin, by the maintenance of 

10 the required particle size, in the case of dispersions, and by the use of surfactants. 

These compositions may also contain adjuvants such as preservatives, 
wetting agents, emulsifying agents, and dispersing agents. Prevention of the action of 
microorganisms may be ensured by the inclusion of various antibacterial and 
antifungal agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the 

15 Hke. It may also be desirable to include isotonic agents such as sugars, sodium 

chloride, and the Hke. Prolonged absorption of the injectable pharmaceutical form 
may be brought about by the inclusion of agents which delay absorption such as 
aluminum monostearate and gelatin. 

In some cases, in order to prolong the effect of the drug, it is desirable 

20 to slow the absorption of the drug from subcutaneous or intramuscular injection. This 

result may be accomplished by the use of a liquid suspension of crystalline or 
amorphous materials with poor water solubility. The rate of absorption of the drug 
then depends upon its rate of dissolution, which in turn may depend upon crystal size 
and crystalline form. Alternatively, delayed absorption of a parenterally administered 

25 drug from is accompUshed by dissolving or suspending the drug in an oil vehicle. 

Injectable depot forms are made by forming microencapsule matrices 
of the drug in biodegradable polymers such a polylactide-polyglycolide. Depending 
upon the ratio of drag to polymer and the nature of the particular polymer employed, 
the rate of drag release can be controlled. Examples of other biodegradable polymers 

30 include poly(orthoesters) and poly(aiihydrides). Depot injectable formulations are 
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also prepared by entrapping the drug in liposomes or microemulsions which are 

compatible with body tissue. 

The injectable formulations can be sterilized, for example, by filtration 

through a bacterial- or viral-retaining filter, or by incorporating sterilizing agents in 

the form of sterile sohd compositions which can be dissolved or dispersed in sterile 

water or other sterile injectable medium just prior to use. 

Solid dosage forms for oral administration include capsules, tablets, 

pills, powders, and granules. Jn such solid dosage forms, the active compound is 

mixed with a least one inert, pharmaceutically-acceptable excipient or carrier such as 
sodium citrate or dicalcium phosphate and/or (a) fillers or extenders such as starches, 
lactose, sucrose, glucose, mannitol, and sihcic acid, (b) binders such as, for example, 
carboxymethylcellulose, gums {e g. alginates, acacia) gelatin, polyvinylpyrroHdone, 
and sucrose, (c) humectants such as glycerol, (d) disintegrating agents such as agar- 
agar, calcium carbonate, potato or tapioca starch, alginic acid, certain silicates, and 
sodium carbonate, (e) solution retarding agents such a paraffin, (f) absorption 
accelerators such as quaternary ammonium compoxmds, (g) wetting agents such as, for 
example, cetyl alcohol and glycerol monostearate, (h) absorbents such as kaolin and 
bentonite clay, and (i) lubricants such as talc, calcium stearate, magnesium stearate, 
solid polyethylene glycols, sodium lauryl sulfate, and mixtures thereof In the case of 
capsules, tablets and pills, the dosage form may also comprise buffering agents. 

Solid compositions of a similar type may also be employed as fillers in 
soft and hard-filled gelatin capsules using such excipients as lactose or milk sugar as 
well as high molecular weight polyethylene glycols and the like. 

The sohd dosage forms of tablets, dragees, capsules, pills, and granules 
can be prepared with coatings and shells such as enteric coatings and other coatings 
well known in the pharmaceutical formulating art. They may optionally contain 
opacifying agents and can also be of a composition that they release the active 
ingredient(s) only, or preferentially, in a part of the intestinal tract, optionally, in a 

delayed manner. Exemplary materials include polymers having pH sensitive 

® 

solubility, including commercially available materials such as Eudragit . Examples 
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of embedding compositions which can be used include polymeric substances and 
waxes. 

The active compounds can also be in micro-encapsulated form if 
appropriate, with one or more of the above-mentioned excipients. 
5 Liquid dosage forms for oral administration include pharmaceutically- 

acceptable emulsions, solutions, suspensions, syrups and elixirs. Li addition to the 
active compounds, the liquid dosage forms may contain inert diluents commonly used 
in the art such as, for example, water or other solvents, solubilizing agents and 
emulsifiers such as ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, 
10 benzyl alcohol, benzyl benzoate, propylene glycol, 1,3-butylene glycol, dimethyl 

formamide, oils (in particular, cottonseed, groundnut, com, germ, olive, castor, and 
sesame oils), glycerol, tetrahydrofiirfuryl alcohol, polyethylene glycols and fatty acid 
esters of sorbitan, and mixtures thereof. 

Besides inert diluents, the oral compositions can also include adjuvants 
1 5 such as wetting agents, emulsifying and suspending agents, sweetening, flavoring, and 

perfuming agents. 

Suspensions, in addition to the active compounds, may contain 
suspending agents such as, for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, alrmiinum 
20 metahydroxide, bentonite, agar-agar, and tragacanth, and mixtures thereof. 

Compositions for rectal or vaginal administration are preferably 
suppositories which can be prepared by mixing the compounds of the present 
invention with suitable non-irritating excipients or carriers such as cocoa butter, 
polyethylene glycol or suppository wax, which are solid at room temperature but 
25 liquid at body temperature. Accordingly, such carriers melt in the rectum or vaginal 

cavity, releasing the active compound. 

Compounds of the present invention can also be administered in the 
form of liposomes. As is known in the art, liposomes are generally derived from 
phospholipids or other lipid substances. Liposomes are formed by mono- or multi- 
30 lamellar hydrated liquid crystals that are dispersed in an aqueous medium. Any non- 

toxic, physiologically-acceptable and metabolizable lipid capable of forming 
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liposomes can be used. The present compositions in liposome form can contain, in 
addition to a compound of the present invention, stabilizers, preservatives, excipients, 
and the like. The preferred lipids are the phospholipids and the phosphatidyl cholines 
(lecithins), both natural and synthetic. Methods to form liposomes are known in the 

5 art. See, for example, Prescott, Ed., Methods in Cell Biolo gy, Volume XIV, 

Academic Press, New York, N.Y. (1976), p. 33 et seq. 

The compounds of the present invention may be used in the form of 
pharmaceutically-acceptable salts derived from inorganic or organic acids. 
"Pharmaceutically-acceptable salts" include those salts which are, within the scope of 

10 sound medical judgment, suitable for use in contact with the tissues of humans and 

lower animals without vmdue toxicity, irritation, allergic response and the like, and are 
commensurate with a reasonable benefit/risk ratio. Pharmaceutically-acceptable salts 
are well known in the art. For example, S. M, Berge, et ai, describe 
pharmaceutically-acceptable salts in detail in J. Pharmaceutical Sciences, 66: 1 

15 (1977), incorporated herein by reference in its entirety. The salts may be prepared in 

situ during the final isolation and purification of the compounds of the present 
invention or separately by reacting a free base function with a suitable acid. 
Representative acid addition salts include, but are not limited to acetate, adipate, 
alginate, citrate, aspartate, benzoate, benzenesulfonate, bisulfate, butyrate, 

20 camphorate, camphorolsulfonate, digluconate, glycerophosphate, hemisulfate, 

heptanoate, hexanoate, fumarate hydrochloride, hydrobromide, hydroiodide, 2- 
hydroxyethanesulfonate (isothionate), lactate, maleate, methanesulfonate, nicotinate, 
2-naphthalenesulfonate, oxalate, pabnoate, pectinate, persulfate, 3-phenylpropionate, 
picrate, pivalate, propionate, succinate, tartrate, thiocyanate, phosphate, glutamate, 

25 bicarbonate, p-toluenesulfonate and undecanoate. Examples of acids which may be 

employed to form pharmaceutically acceptable acid addition salts include inorganic 
acids as hydrochloric acid, hydrobromic acid, sulphuric acid and phosphoric acid and 
such organic acids as oxalic acid, maleic acid, succinic acid and citric acid. 

Basic nitrogen-containing groups can be quatemized with agents such 

30 as, for example, lower alkyl halides including methyl, ethyl, propyl, and butyl 

chlorides, bromides and iodides; dialkyl sulfates like dimethyl, diethyl, dibutyl and 



28 



diamyl sulfates; long chain halides such as decyl, lauryl, myristyl and stearyl 
chlorides, bromides and iodides; arylalkyl halides like benzyl and phenethyl bromides 
and others. Water or oil-soluble or dispersible products are thereby obtained. 

Basic addition salts can be prepared in situ during the final isolation 
5 and purification of compounds of the present invention by reacting a carboxylic acid- 

containing moiety with a suitable base such as the hydroxide, carbonate or bicarbonate 
of a pharmaceutically acceptable metal cation or with ammonia or with an organic 
primary, secondary or tertiary amine. Pharmaceutically-acceptable basic addition salts 
include, but are not hmited to, cations based on alkali metals or alkaline earth metals 

10 such as lithium, sodium, potassium, calcium, magnesium and aluminum salts and the 

like and nontoxic quaternary ammonia and amine cations including ammonium, 
tetramethylammonium, tetraethylammonium, methylamine, dimethylamine, 
trimethylamine, triethylamine, diethylamine, ethylamine and the like. Other 
representative organic amines useful for the formation of base addition salts include 

15 ethylenediamine, ethanolamine, diethanolamine, piperidine, piperazine and the like. 

Dosage forms for topical administration of a compound of the present 
invention include powders, sprays, ointments and inhalants. The active compound is 
mixed under sterile conditions with a pharmaceutically-acceptable carrier and any 
needed preservatives, buffers, or propellants which may be required. Ophthalmic 

20 formulations, eye ointments, powders, and solutions are also contemplated as being 

within the scope of the present invention. 

Actual dosage levels of active ingredients in the pharmaceutical 
compositions of this present invention may be varied so as to obtain an amount of the 
active compound(s) that is effective to achieve the desired therapeutic response for a 

25 particular patient, compositions, and mode of administration. The selected dosage 

level will depend upon the activity of the particular compound, the route of 
administration, the severity of the condition being treated, and the condition and prior 
medical history of the patient being treated. However, it is within the skill of the art 
to start doses of the compound at levels lower than required to achieve the desired 

30 therapeutic effort and to gradually increase the dosage until the desired effect is 

achieved. 
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Generally dosage levels of about 0.1 to about 1000 mg, about 0.5 to 
about 500 mg, about 1 to about 250 mg, about 1 .5 to about lOOmg, and preferably of 
about 5 to about 20 mg of active compound per kilogram of body weight per day are 
administered orally or intravenously to a mammalian patient. If desired, the effective 
daily dose maybe divided into multiple doses for purposes of administration, e.g., two 
to four separate doses per day. 

The efficacy of the compounds of the present invention have been 
investigated and can be described by parameters, such as, for example EC50 and 
LC50. As used herein, the term EC50 refers to the effective concentration needed to 
inhibit activity by 50% in a cell based assay. The term IC50, as used herein, refers to 
the concentration required to inhibit protein activity in a biochemical assay by 50%. 
The term LD50, as used herein, refers to the compound concentration necessary to kill 
50% of the cells over a defined time interval in toxicity assays. 

TABLE 1 

Proteins which Comprise I or A domains, 
G proteins, heterotrimeric G proteins, and tubulin GTPase. 

1 . TIM beta/alpha-barrel (23) 

contains parallel beta-sheet barrel, closed; n=^8, S=8; strand order 12345678 
the first six superfizmilies have similar phosphate-binding sites 

1 . Triosephosphate isomerase (TIM) (1) 

1 . Triosephosphate isomerase (TIM) (12) 

2. Ribulose-phoshate binding barrel (4) 

1 . Histidine biosynthesis eazymes (2) 

structural evidence for the gene duplication within the barrel 
fold 

2. D-ribulose-5-phosphate 3-epimerase (1) 

3. Orotidine 5 '-monophosphate decarboxylase (OMP 
decarboxylase) (4) 

4. Tryptophan biosynthesis enzymes (6) 

3. Thiamin phosphate synthase (1) 
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TABLE 1 (continued) 

1 . Thiamin phosphate synthase ( 1 ) 

FMN-linked oxidoreductases (1) 

1 . FMN-Hnked oxidoreductases (9) 

Inosine monophosphate dehydrogenase (IMPDH) (1) 

The phosphape moiety of substrate binds in the 'common ' 

phosphate-binding site 

1 . Inosine monophosphate dehydrogenase (IMPDH) (4) 
PLP-binding barrel (2) 

circular permutation of the canonical fold: begins with an alpha helix 
and ends with a beta-strand 

1 . Alanine racemase-hke, N-terminal domain (4) 

2. "Hypothetical" protein ybl036c (1) 
NAD(P)-linked oxidoreductase (1) 

1 . Aldo-keto reductases (NADP) (7) 

Common fold covers whole protein structure 
(Trans)glycosidases (7) 

1. alpha- Amylases, N-terminal domain (22) 

Common fold domain is interrupted by a small calcium-binding 
subdomain 

This domain is followed by an all-beta domain common to the 
family 

2. beta- Amylase (4) 

3. beta-glycanases (21) 

consist of a number of sequence families 

4. Family 1 of glycosyl hydrolase (8) 

5. Type 11 chitinase (9) 
glycosylase family 18 

6. Bacterial chitobiase (beta-N-acetylhexosaminidase), catalytic 
domain (1) 

Glycosyl hydrolase family 20 
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TABLE 1 (continued) 

7. Beta-D-glucan exohydrolase, N-terminal domain (1) 
Metallo-dependent hydrolases (3) 

the beta-sheet barrel is similarly distorted and capped by a C-terminal 
helix has transition metal ions bound inside the barrel 

1 . Adenosine deaminase (ADA) ( 1 ) 

2. alpha-subimit of urease, catalytic domain (2) 

3. Phosphotriesterase-like (2) 
Aldolase (4) 

Common fold covers whole protein structure 

1. Class I aldolase (14) 

the catalytic lysine forms schiff-base intermediate with 
substrate 

2. Class n aldolase (1) 
metal-dependent 

3. 5-aminolaevulinate dehydratase, ALAD (porphobilinogen 
synthase) (3) 

hybrid of classes I and II aldolase 

4. Class I DAHP synthetase (2) 
Enolase C-terminal domain-like (2) 

binds metal ion (magnesium or manganese) in conserved site inside 
barrel 

N-terminal alpha+beta domain is common to this family 

1 . Enolase (2) 

2. D-glucarate dehydratase- hke (6) 
Phosphoenolpyruvate/pymvate domain (6) 

1 . Pyruvate kinase (5) 

2. Pyruvate phosphate dikinase, C-terminal domain (1) 

3 . Phosphoenolpyruvate carboxylase ( 1 ) 

4. Phosphoenolpyruvate mutase ( 1 ) 
forms a swapped dimer 
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TABLE 1 (continued) 

5 . 2-dehydro-3-deoxy-galactarate aldolase ( 1 ) 

forms a swapped dimer; contains a PK-type metal-binding site 

6. Isocitrate lyase (2) 

forms a swapped dimer; elaborated with additional 
5 subdomains 

1 3 . Malate synthase G ( 1 ) 

1 . Malate synthase G ( 1 ) 

14. RuBisCo, C-terminal domain (1) 

1 . RuBisCo, large subunit, C-terminal domain (6) 
1 0 N-terminal domain is alpha-^beta 

1 5 . Xylose isomerase-like (3) 

different families share similar but non-identical metal-binding sites 

1 . Endonuclease IV ( 1 ) 

2 . L-rhanmose isomerase ( 1 ) 
15 3 . Xylose isomerase (12) 

16. Bacterial luciferase-like (3) 

consists of clearly related families of somewhat different folds 
1 . Bacterial luciferase (alkanal monooxygenase) (1 ) 
typical (beta/alpha) 8-barr el fold 
20 2. Non-fluorescent flavoprotein (luxF, FP390) (2) 

incomplete beta/alpha barrel with mixed beta-sheet of 7 
strands 

3. Coenzyme F420 dependent tetrahydromethanopterin reductase 
(1) 

25 1 7. Quinolinic acid phosphoribosyltransferase, C-terminal domain (1) 

incomplete beta/alpha barrel with parallel beta-sheet of 7 strands 
1 . Quinolinic acid phosphoribosyltransferase, C-terminal domain 
(2) 

18. Phosphatidylinositol-specific phospholipase C (PI-PLC) (2) 
30 1 . Mammalian PLC (1) 
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TABLE 1 (continued) 

2. Bacterial PLC (2) 

1 9. Cobalamin (vitamin B 1 2)-dependent enzymes (3) 

1 . Methylmalonyl-CoA mutase, N-terminal (CoA-binding) 
domain (1) 

2. Glutamate mutase, large subunit (1) 

3. Diol dehydratase, alpha subunit (1) 

20. tRNA-guanine transglycosylase (1) 

1 . tRNA-guanine transglycosylase ( 1 ) 

2 1 . Dihydropteroate synthetase-like (2) 

1 . Dihydropteroate synthetase (3) 

2 . Methylt etrahydrofolate : corrinoid/iron-sulfur protein 
methyltransferase MetR (1) 

22. Uroporphyrinogen decarboxylase, UROD (1) 

1 . Uroporphyrinogen decarboxylase, UROD ( 1 ) 

23. Methylenetetrahydro folate reductase (1) 

1 . Methylenetetrahydrofolate reductase ( 1 ) 
NAD(P)-binding Rossmann-fold domains (1) 

core: 3 layers, a/b/a; parallel beta-sheet of 6 strands, order 321456 

The nucleotide-binding modes of this and the next two folds/superfamilies are 

similar 

1 . NAD(P)-binding Rossmann-fold domains (8) 

1 . Alcohol/glucose dehydrogenases, C-terminal domain (9) 
N-tetminal all-beta domain defines family 

2. T>Tosine-dependent oxidoreductases (27) 

also known as short-chain dehydrogenases and SDR family 
parallel beta-sheet is extended by 7th strand, order 3214567; 
left-handed 

crossover connection between strands 6 and 7 

3. Glyceraldehyde-3-phosphate dehydrogenase- like, N-terminal 
domain (20) 
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TABLE 1 (continued) 



family members also share a common alpha+beta fold in 



C-terminal domain 



4. 



6. 



5. 



Formate/glycerate dehydrogenases, NAD-domain (9) 

this domain interrupts the other domain which defines family 

Lactate & malate dehydrogenases, N-terminal domain (16) 

6-phosphogluconate dehydrogenase-hke, N-terminal domain 

(8) 



the beta-sheet is extended to 8 strands, order 32145678; 



7. 



strands 7 & 8 are antiparallel to the rest 
C-terminal domains also show some similarity 
Amino-acid dehydrogenase-like, C-terminal domain (11) 



8. Succinyl-CoA synthetase, alpha-chain, N-terminal 
(CoA-binding) domain (2) 
FAD/NAD(P)-binding domain (1) 

core: 3 layers, b/b/a; central parallel beta-sheet of 5 strands, order 32145; 
top antiparallel beta-sheet of 3 strands, meander 
1 . FAD/NAD(P)-binding domain (5) 

L C-terminal domain of adrenodoxin reductase-like (3) 

2 . F AD-linked reductases, N-terminal domain (10) 
C-terminal domain is alpha+beta is common for the family 

3. Guanine nucleotide dissociation inhibitor, GDI (1) 



Similar to F AD-linked reductases in both domains but does not 



bind FAD 



4. 



5. 



Succinate dehydrogenase/fiimarate reductase N-terminal 
domain (5) 

FAD/NAD-linked reductases, N-terminal and central domains 
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TABLE 1 (continued) 

Nucleotide-binding domain (1) 

3 layers: a/b/a; parallel beta-sheet of 5 strands, order 32145; Rossmann-like 

1 . Nucleotide-binding domain (2) 

this superfamily shares the common nucleotide-binding site with and 
provides a link between the Rossmann-fold NAD(P)-binding and 
FAD/NAD(P)-binding domains 

1 . N-terminal domain of adienodoxin reductase-like (3) 

2. D-amino acid oxidase, N-terminal domain (2) 

This family is probably related to the FAD-linked reductases 
and shares with them the C-terminal domain fold 

N-temiinal domain of MurD (UDP-N-acetylmuramoyl-L-alanine:D-glutamate 

ligase) (1) 

3 layers: a/b/a; parallel beta-sheet of 5 strands, order 32145; incomplete 

Rossmann-like fold; binds UDP group 

1 . N-temiinal domain of MurD (UDP-N- 

acetylmuramoyl-L-alanine:D-glutamate ligase) (1) 

1 . N-terminal domain of MurD 

(UDP-N-acetylmuramoyl-L-alanine:D"glutamate ligase) (1) 

Cellulases (1) 

variant of beta/alpha barrel; parallel beta-sheet barrel closed, n'=7, S=^8; 
strand order 1234567 
1. Cellulases (1) 

1. Cellulases (4) 
PFL-like glycyl radical enzymes (1) 

contains: barrel closed; n=10, S=10; accommodates a hairpin loop inside the 
barrel 

1 . PFL-like glycyl radical enzymes (3) 

duplication: the - and C~terminal halves have similar topologies 

1 . Pyruvate formate-lyase, PFL ( 1 ) 

2. Rl subunit of ribonucleotide reductase, C-terminal domain (1) 
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TABLE 1 (continued) 

3. Class III anaerobic ribonucleotide triphosphate reductase 
NRDD subunit (1) 
The "swivelling" beta/beta/alpha domain (5) 

3 layers: h/b/a; the central sheet is parallel and the other one is antiparallel; 
there are some 
variations in topology 

this domain is thought to be mobile in all proteins known to contain it 

1 . Phosphohistidine domain (2) 
contains barrel, closed, n=7, S=10 

1 . Pyruvate phosphate dikinase, central domain (1) 

2. N-terminal domain of enzyme I of the PEPisugar 
phosphotransferase system (1) 

2. Aconitase, C-terminal domain (1) 

contains mixed beta-sheet barrel, closed n^-7, S=10 
1 . Aconitase, C-terminal domain (2) 

3. Carbamoyl phosphate synthetase, small subunit N-terminal domam (1) 
1 . Carbamoyl phosphate synthetase, small subunit N-terminal 

domain (1) 

4. Transferrin receptor ectodomain, apical domain (1) 

1 . Transferrin receptor ectodomain, apical domain (1) 

5. GroEL-like chaperone, apical domain (2) 

1. GroEL(2) 

2. Group n chaperonin (CCT, TRIC) (1) 
Barstar-like (2) 

2 layers, a/b; parallel beta-sheet of 3 strands, order 123 

1 . B arstar (bamase inhibitor) ( 1 ) 

1 . Barstar (bamase inhibitor) (1 ) 

2. Ribosomal protein L32e (1) 

1 , Ribosomal protein L3 2e ( 1 ) 

contains irregular N-terminal extension to the common fold 
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TABLE 1 (continued) 

Leucine-rich repeat, LRR (right-handed beta- alpha superhehx) (2) 
2 curved layers, a/b; parallel beta-sheet; order 1234...N 

1. RNI-like(3) 

regular structure consisting of similar repeats 

1 . Ribonuclease inhibitor (2) 

2. Rnalp(l) 

3. CycHn A/CDK2-associated pl9, Skp2 (1) 

2. L domain-Uke (5) 

less regular structure consisting of variable repeats 

1 . Intemahn B LRR domain ( 1 ) 

2. Rab geranylgeranyltransferase alpha-subunit, C-terminal 
domain (1) 

3 . mRN A export factor tap ( 1 ) 

4. U2A'-like(l) 

duplication: consists of 5-6 partly irregular repeats 

5. LI and L2 domains of the type 1 insuhn-Uke growth factor 
receptor (1) 

Outer arm dynein Ught chain 1(1) 

(beta-beta-alpha)n superhelix 

1 . Outer arm dynein light chain 1 (1) 

1 . Outer arm dynein light chain 1 (1) 
Ribosomal proteins L15p and L18e (1) 
core: three turns of irregular (beta-beta-alpha)n superhelix 
1 . Ribosomal proteins LI 5p and LI 8e (1) 

1 . Ribosomal proteins LI 5p and LI 8e (2) 
SpoHaa-like (2) 

core: 4 turns of a (beta-alphajn superhelix 

1 . C-terminal domain of phosphatidylinositol transfer protein secl4p (1) 
1 . C-terminal domain of phosphatidylinositol transfer protein 
secl4p (1) 
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TABLE 1 (continued) 

2. Spollaa(l) 

L SpoIIaa(l) 
14. ClpP/crotonase(l) 

core: 4 turns of (beta-beta'alpha)n superhelix 
5 L ClpP/crotonase (3) 

1 , Clp protease, ClpP subunit ( 1 ) 

2, Photo system II Dl C-terminal processing protease, catalytic 
domain (1) 

3, Crotonase-like (4) 
10 15, BRCT domain (1) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands y order 2134 
1, BRCT domain (2) 

1 . DNA-repair protein XRCCl (1) 

2. NAD+-dependent DNA ligase, domain 4(1) 

15 16. beta-subunit of the lumazine synthase/riboflavin synthase complex (1) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 2134 
1 . beta-subunit of the lumazine synthase/riboflavin synthase complex ( 1 ) 
1 . beta-subunit of the lumazme synthase/riboflavin synthase 
complex (4) 
20 17. Caspase4ike(l) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 2134 
1 . Caspase-like (2) 

heterodimeric protein folded in a single domain 
1 . Caspase (3) 

25 2. Gingipain R (RgpB), N-terminal domain (1) 

1 8. DNA glycosylase ( 1 ) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 2134 
1 . DNA glycosylase (2) 

1 . Uracil-DNA glycosylase (3) 
30 2. G:T/U mismatch-specific DNA glycosylase (1) 



39 



TABLE 1 (continued) 

Catalytic domain of malonyl-CoA ACP transacylase (1) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 2134 

1 . Catalytic domain of malonyl-CoA ACP transacylase (1) 

1. Catalytic domain of malonyl-CoA ACP transacylase (1) 
Initiation factor IF2/eIF5b, domain 3 (1) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 2134 
1. Initiation factor IF2/eIF5b, domain 3(1) 

1 . Initiation factor IF2/eIF5b, domain 3(1) 
Ribosomal protein LI 3 (1) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 3214 
1 . Ribosomal protein L 1 3 ( 1 ) 

1 . Ribosomal protein L 1 3 ( 1 ) 
Ribosomal protein L4 ( 1 ) 

3 layers, a/b/a; core: parallel beta-sheet of 4 strands, order 1423 

1 . Ribosomal protein L4 ( 1 ) 

1 . Ribosomal protein L4 (2) 
Flavodoxin-like (16) 

3 layers, a/b/a; parallel beta-sheet of 5 strand, order 21345 
L CheY-like(3) 

1. CheY-related(ll) 

2. Receiver domain of the ethylene receptor (1) 

3. Negative regulator of the amidase operon AmiR (1) 

2. Toll/titerleukin receptor TIR domain (1) 

1 . ToU/Interleukin receptor TIR domain (2) 

3. Hypothetical protein MTH538 (1) 

1 , Hypothetical protein MTH538 (1) 

4. Succinyl-CoA synthetase domains (1) 

1 , Succinyl-CoA synthetase domains (4) 

contain additional N-terminal strand "0", antiparallel to strand 
2 
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TABLE 1 (continued) 

Flavoproteins (3) 

1 . Flavodoxin-related (8) 
binds FMN 

2. NADPH-cytochrome p450 reductase, N-terminal domain (2) 

3 . Quinone reductase (4) 
binds FAD 

Cobalamin (vitamin B12)-binding domain (1) 

1 . Cobalamin (vitamin B 1 2)-binding domain (4) 

Ornithine decarboxylase N-terminal "wing" domain (1) 

1 . Ornithine decarboxylase N-terminal "wing" domain (1) 

N5-carboxyaminoimidazole ribonucleotide (N5-CAIR) mutase PurE 

(1) 

1 . N5-carboxyaminoimidazole ribonucleotide (N5-CAIR) mutase 

PurE (1) 
Cutinase-like (1) 
1 . Cutinase-like (3) 

this family can be also classified into alpha/beta hydrolase 

superfamily 
Esterase/acetylhydrolase (4) 

1. Esterase (1) 

2. Esterase domain of haemagglutinin-esterase- fusion 
glycoprotein HEFl (1) 

3 . Acetylhydrolase ( 1 ) 

4. Rhanmogalacturonan acetylesterase (1) 
Beta-D-glucan exohydrolase, C-terminal domain (1) 

1 . Beta-D-glucan exohydrolase, C-terminal domain (1) 
Formate/glycerate dehydrogenase catalytic domain-like (3) 
1 , Formate/glycerate dehydrogenases, substrate-binding domain 
(6) 

this domain is interrupted by the Rossmann-fold domain 
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TABLE 1 (continued) 

2. L-alanine dehydrogenase (1) 

3 . S-adenosylhomocystein hydrolase (2) 

1 3 . Type II 3 -dehydroquinate dehydratase ( 1 ) 

1 . Type II 3-dehydroqumate dehydratase (2) 

1 4. Nucleoside 2-deoxyribosyltransferase (1 ) 

1 . Nucleoside 2-deoxyribosyltransferase ( 1 ) 

1 5 . Ribosomal protein S2 ( 1 ) 

fold elaborated with additional structures 
1 . Ribosomal protein S2 ( 1) 

1 6. Class I glut amine amidotransferase-like (4) 

conserved positions of the oxyanion hole and catalytic nucleophile; 
different constituent families contain different additional structures 

1 . Class I glutamine amidotransferases (GAT) (3) 
contains a catalytic Cys-His-Glu triad 

2. Intracellular protease (1) 

contains a catalytic Cys-His-Glu triad that dijfersfrom the 
class I OA T triad 

3 . Catal ase, C-terminal domain ( 1 ) 

4. Aspartyl dipeptidase PepE (1) 

probable circular permutation in the common core; contains a 
catalytic Ser-His-Glu triad 
Methylglyoxal synthase-like (1) 

3 layers, a/b/a; parallel beta-sheet of 5 strands, order 32145 
1 . Methylglyoxal synthase-like (2) 

contains a common phosphate-binding site 

1 . Carbamoyl phosphate synthetase, large subunit allosteric, 
C-terminal domain (1) 

2. Methylglyoxal synthase, MgsA (1) 
Ferredoxin reductase-like, C-terminal NADP-linked domain (1) 
3 layers, a/b/a; parallel beta-sheet of 5 strands, order 32145 
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TABLE 1 (continued) 

1 . Ferredoxin reductase-like, C-terminal NADP-linked domain (5) 
binds NADP differently than classical Rossmann-fold 
N-terminal FAD-linked domain contains (6 JO) barrel 
L Reductases (10) 
2, Phthalate dioxygenase reductase (1) 

contains additional 2Fe-2S ferredoxin domain 
3 Dihydroorotate dehydrogenase B, PyrK subunit (1) 

contains 2Fe-2S cluster in the C-terminal extension 

4. NADPH-cytochrome p450 reductase-like (2) 

5. Flavohemoglobin, C-terminal domain (1) 
contains additional globin domain 

Adenine nucleotide alpha hydrolase-like (3) 

core: 3 layers, a/b/a ; parallel beta-sheet of 5 strands, order 32145 

1 . Nucleotidylyl transferase (3) 

1 . Class I aminoacyl-tRNA synthetases (RS), catalytic domain 
(10) 

contains a conserved all-alpha subdomain at the C-terminal 
extension 

2 . Cytidylyltransferase ( 1 ) 

3. Adenylyltransferase (2) 

2. Adenine nucleotide alpha hydrolases (2) 

1 . N-type ATP pyrophosphatases (3) 

2. Phosphoadenylyl sulphate (PAPS) reductase (1) 

3. UDP-glucose dehydrogenase (UDPGDH), C-terminal (UDP-binding) 
domain (I) 

1 , UDP-glucose dehydrogenase (UDPGDH), C-terminal 
(UDP-binding) domain (1) 
Pyrimidine nucleoside phosphorylase central domain (1) 
3 layers: a/b/a; parallel beta-sheet of 5 strands, order 32145; Rossmann-like 
1 . Pyrimidine nucleoside phosphorylase central domain (1) 
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TABLE 1 (continued) 

1 . Pyrimidine nucleoside phosphorylase central domain (2) 
N-terminal domain of DNA photolyase (1) 

3 layers: a/b/a; parallel beta-sheet of 5 strands, order 32145; Rossmann-like 
1 . N-terminal domain of DNA photolyase (1) 

1 . N-terminal domain of DN A photolyase (2) 
ETFP adenine nucleoti de-binding domain-like (1) 
3 layers: a/b/a, core: parallel beta-sheet of 5 strands, order 32145 
1 . ETFP adenine nucleotide-binding domain-like (2) 

1. Electron transfer flavoprotein, ETFP (2) 

contains additional strands on both edges of the core sheet 

1. '^Hypothetical" protein MJ0577 (1) 
Biotin carboxylase N-terminal domain-like (1) 
3 layers: a/b/a; parallel or mixed beta-sheet of 4 to 6 strands 
possible rudiment form of Rossmannfbld domain 
1 . Biotin carboxylase N-terminal domain-like (5) 

superfamily defined by the common ATP-binding domain that follows 

this one 

1 . Biotin carboxylase/Carbamoyl phosphate synthetase (5) 

2. D- Alanine ligase N-terminal domain (2) 

3. Prokaryotic glutathione synthetase, N-terminal domain (1) 

4. Eukaryotic glutathione synthetase (1) 
circularly permuted version of prokaryotic enzyme 

5 . Synapsin la domain ( 1 ) 
DHS-like NAD/FAi:)-binding domain (1 ) 

3 layers: a/b/a; parallel beta-sheet of 6 strands, order 321456; Rossmann-like 
1 . DHS-like NAD/FAD-binding domain (4) 

binds cofactpr molecules in the opposite direction than classical 

Rossman fold 

1 . Deoxyhypusine synthase, DHS (1) 



44 



TABLE 1 (continued) 

2. C-terminal domain of the electron transfer flavoprotein alpha 
subunit (2) 

lacks strand 3; shares the FAD-binding mode with the pyruvate 
oxidase domain 

3- Pyruvate oxidase and decarboxylase, middle domain (5) 

N-terminal domain is Pyr module, and C-terminal domain is 
PP module of thiamin diphosphate-binding fold 

4. Transhydrogenase domain III (dlll) (3) 

binds NADP, shares with the pyruvate oxidase FAD-binding 
domain a common ADP-binding mode 
Tubulin, GTPase domain (1) 

3 layers: a/b/a; parallel beta-sheet of 6 strands, order 321456 
1 . Tubulin, GTPase domain (1) 

1 . Tubulin, GTPase domain, (3) 
Cysteine hydrolase (1) 

3 layers: a/b/a; parallel beta-sheet of 6 strands, order 321456 
1 . Cysteine hydrolase (2) 

1 , N-carbamoylsarcosine amidohydrolase ( 1 ) 

2. YcaC (1) 
Halotolerance protein Hal3 (1) 

3 layers: a/b/a; parallel beta-sheet of 6 strands, order 321456 
1 . Halotolerance protein Hal3 ( 1 ) 

1 , Halotolerance protein Hal3 ( 1 ) 
Glucosamine 6"phosphate deaminase/isomerase (1) 
3 layers: a/b/a; parallel beta-sheet of 6 strands, order 324561 
1 . Glucosamine 6-phosphate deaminase/isomerase (1) 

1 . Glucosamine 6-phosphate deaminase/isomerase (2) 
Thiamin diphosphate-binding fold (THDP-binding) (1) 
3 layers: a/b/a; parallel beta-sheet of 6 strands, order 213465 
1 . Thiamin diphosphate-binding fold (THDP-binding) (4) 
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TABLE 1 (continued) 

both pyridine (Pyr)- and pyrophosphate (PP)-binding modules have 
this fold 

conserved core consists of two Pyr and two PP-modules and binds two 
coenzyme molecules 

1 . Pyruvate oxidase and decarboxylase (5) 

Pyr module is N~terminal domain, PP module is C~terminal 
domain 

Rossmann-like domain is between them 

2 . Transketolase, TK ( 1 ) 

3. Branched-chain alpha-keto acid dehydrogenase (2) 
parent family to TK and PFOR 

heterodimeric protein related to TK; alpha-subunit is the PP 
module and the N-terminal domain of beta-subunit is the Pyr 
module 

4. Pyruvate-ferredoxin oxidoreductase, PFOR, domains I and VI 
(1) 

domains VI, I and II are arranged in the same way as the TKN, 

M and C domains 
P-loop containing nucleotide triphosphate hydrolases (1) 
3 layers: a/b/a, parallel or mixed beta-sheets of variable sizes 
1 . P-loop containing nucleotide triphosphate hydrolases (14) 
division into families based on beta-sheet topologies 

1 . Nucleotide and nucleoside kinases ( 1 6) 
parallel beta-sheet of 5 strands, order 23145 

2. Shikimate kinase (1) 

similar to the nucleotide/nucleoside kinases but acts on 
different substrate 

3 . Chloramphenicol phosphotransferase ( 1 ) 

similar to the nucleotide/nucleoside kinases but acts on 
different substrate 
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TABLE 1 (continued) 

4. Adenosine-5'phosphosulfate kinase (APS kinase) (1) 

5 . PAPS sulfotransferase (4) 

similar to the nucleotide/nucleoside kinases but transfer 
sulphate group 

6. Phosphoribulokinase/pantothenate kinase (2) 

7. 6-phosphofructo-2-kinase/fiTictose-2,6-bisphosphatase, kinase 
domain (1) 

8. G proteins (28) 

core: mixed beta-sheet of 6 strands, order 231456; strand 2 is 
antiparallel to the rest 

9. Motor proteins (7) 

1 0 . Nitrogenase iron protein-like (10) 

core: parallel beta-sheet of 7 strands; order 3241567 

1 1 . Rec A protein-like (ATPase-domain) (9) 

core: mixed beta-sheet of 8 strands, order 32451678; strand 7 
is antiparallel to the rest 

1 2 . ABC transporter ATPase domain-like (7) 

there are two additional subdomains inserted into the central 
core that has a RecA-like topology 

13. Extended AAA-ATPase domain (13) 

fold is similar to that of RecA, but lacks the last two strands, 
followed by a family-specific all-alpha Arg-finger domain 

14. RNAhelicase(l) 

duplication: consists of two similar domains, one binds NTP 
and the other binds RNA; also contains an all-alpha 
subdomain in the C-terminal extension 
Fructose permease, subunit lib (1) 

3 layers: a/b/a, parallel beta-sheet of 6 strands, order 324156 
1 . Fructose permease, subunit lib (1) 

1 . Fructose permease, subunit lib (1) 
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TABLE 1 (continued) 

39. Nicotinate mononucIeotide:5,6-dimethylbenzimidazole 
phosphoribosyltransferase (CobT) (1) 

3 layers: a/b/a, parallel beta-sheet of 7 strands, order 3214567 
1 . Nicotinate mononucleotide: 5, 6-dimethylbenzimida2ole 
5 phosphoribosyltransferase (CobT) ( 1 ) 

1 . Nicotinate mononucleotide : 5 ,6-dimethylbenzimidazole 
phosphoribosyltransferase (CobT) (1) 

40. Methylesterase CheB, C-terminal domain (1) 

3 layers: a/b/a, parallel beta-sheet of 7 strands, order 3421567 
10 1 . Methylesterase CheB, C-terminal domain (1) 

1 , Methylesterase CheB, C-terminal domain ( 1 ) 

41. Subtilisin-iike(l) 

3 layers: a/b/a, parallel beta-sheet of 7 strands, order 2314567; left-handed 
crossover connection between strands 2 & 3 
15 1. Subtilisin-like (2) 

L Subtilases(12) 

2. Serine-carboxy] proteinase PSCP (1) 
elaborated with additional structures 

42. Arginase/deacetylase (1) 

20 3 layers: a/b/a, parallel beta-sheet of 8 strands, order 21387456 

1 . Arginase/ deacetylase (2) 
L Arginase (2) 
2, Histone deacetylase, HDAC (1) 

43 . Co A-dependent acyltransferases ( 1 ) 

25 core: 2 layers, a/b; mixed beta-sheet of 6 strands, order 324561; strands 3 & 

6 are antiparallel to the rest 
1 . Co A-dependent acyltransferases ( 1 ) 

1 . CoA-dependent acyltransferases (5) 

44. Phosphotyrosine protein phosphatases I-Uke (2) 

30 3 layers: a/b/a; parallel beta-sheet of 4 strands, order 2134 
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TABLE 1 (continued) 

1 . Phosphotyrosine protein phosphatases 1(1) 

share the common active site structure with the family II 
1 . Low-molecular-weight phosphotyrosine protein phosphatases 
(3) 

2. Enzyme IIB-cellobiose (1) 

1 . Enzyme IIB-cellobiose ( 1 ) 
(Phosphotyrosine protein) phosphatases II (1) 
core: 3 layers, a/b/a; parallel beta-sheet of 4 strands, order 1432 
1 . (Phosphotyrosine protein) phosphatases 11 (3) 

share with the family I the common active site structure with a 

circularly permuted topology 

1 . Dual-specificity phosphatases (2) 

2. Higher-molecular-weight phosphot>Tosine protein phosphatases 
(8) 

have an extension to the beta-sheet of 3 antiparallel strands 
before strand 4 

3. Phoshphoinositide phosphatase Pten (Pten tumor suppressor), 
N-teraiinal domain (1) 

Rhodanese/Cell cycle control phosphatase (1) 

3 layers: a/b/a; parallel beta-sheet of 5 strands, order 32451 

1 , Rhodanese/Cell cycle control phosphatase (2) 

the active site structure is similar to those of the families I and II 
protein phosphatases; the topology can be related by a different 
circular permutation to the family I topology 
L Cell cycle control phosphatase, catalytic domain (2) 
2. Sulfurtransferase (rhodanese) (2) 

duplication, consists of two domains of this fold 

Thioredoxin fold (3) 

core: 3 layers, a/b/a; mixed beta-sheet of 4 strands, order 4312; strand 3 is 
antiparallel to the rest 

49 



TABLE 1 (continued) 

1 . Thioredoxin-like (10) 

1 . Thioltransferase (12) 

2. PDI-like (3) 

duplication: contains two tandem repeats of this fold 

3. Calsequestrin (1) 

duplication: contains three tandem repeats of this fold 

4. Disulphide-bond formation facilitator (DSBA) (2) 

5. Glutathione S-transferases, N-terminal domain (23) 

6. Phosducin (2) 

7. Endoplasmic reticulum protein ERP29, N-domain (1) 

8. spliceosomal protein U5-15Kd (1) 

9. Disulfide bond isomerase, DsbC, C-terminal domain (1) 
elaborated common fold 

1 0 . Glutathione peroxidase-like (6) 

2. RNA 3'-temiinal phosphate cyclase, RPTC, insert domain (1) 

1 . RNA 3'-tenninal phosphate cyclase, RPTC, insert domain (1) 
. 3. Thioredoxin-like 2Fe-2S ferredoxin (1) 

1 . Thioredoxin-like 2Fe-2S ferredoxin ( 1 ) 
Transketolase C-terminal domain-like (1) 

3 layers: a/b/a; mixed beta-sheet of 5 strands, order 13245, strand 1 is 
antiparallel to the rest 

1 . Transketolase C-terminal domain-like (3) 

1 . Transketolase ( 1 ) 

2. Branched-chain alpha-keto acid dehydrogenase beta-subunit, 
C-domain (2) 

3. Pyruvate-ferredoxin oxidoreductase, PFOR, domain 11 (1) 
Pyruvate kinase C-terminal domain-Uke (2) 

3 layers: a/b/a; mixed beta-sheet of 5 strands, order 32145, strand 5 is 
antiparallel to the rest 

1 . Pyruvate kinase, C-terminal domain (1) 
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TABLE 1 (continued) 

1 . Pyruvate kinase, C-terminal domain (5) 
2 . ATP syntase (F 1 - ATPase), gamnrxa subunit ( 1 ) 

contains an antiparallel coiled coil formed by - anh C-terminal 

extensions to the common fold 

1 . ATP syntase (Fl -ATPase), gamma subunit (2) 
Leucine aminopeptidase, N-tenninal domain (1) 

3 layers: a/b/a; mixed beta-sheet of 5 strands, order 23145; strand 2 is 
antiparallel to the rest 

1 . Leucine aminopeptidase, N-terminal domain (1) 

1 . Leucine aminopeptidase, N-terminal domain ( 1 ) 
Anticodon-binding domain-like (4) 

3 layers: a/b/a; mixed beta-sheet of five strands, order 21345; strand 4 is 
antiparallel to the rest 

1 . Anticodon-binding domain of Class n aaRS ( 1 ) 

1 . Anticodon-binding domain of Class 11 aaRS (5 ) 

2. TolB, N-terminal domain (1) 

1 . TolB, N-terminal domain (1) 

3. Diol dehydratase, beta subunit (1) 

1 . Diol dehydratase, beta subunit (1) 

contains additional structures in the C-terminal extension 

4. Maf/Haml (2) 

elaborated with additional structures inserted in the common fold 

1. Haml (1) 

2. Maf protein (1) 
Restriction endonuclease-like (3) 

core: 3 layers, a/b/a; mixed beta-sheet of 5 strands, order 12345; strands 2 c&, 
in some families, 5 are antiparallel to the rest 
1 . Restriction endonuclease-like (1 7) 

1 . Restriction endonuclease EcoRI (1 ) 

2, Restriction endonuclease EcoRV (1) 
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TABLE 1 (continued) 

3. Restriction endonuclease BamHI (1) 

4. Restriction endonuclease Bgll (1) 

5. Restriction endonuclease Bglll (1) 

6. Restriction endonuclease PvuII (1) 
5 7. Restriction endonuclease CM 01 (1) 

8. Restriction endonuclease Muni (1) 

9. Restriction endonuclease Nael (1) 

10. Restriction endonuclease NgoIV (1) 

1 1 . Restriction endonuclease BsobI (1) 

10 12. Restriction endonuclease Fokl, C-terminal (catalytic) domain 

(1) 

1 3 . lambda exonuclease ( 1 ) 

1 4. DNA mismatch repair protein MutH from ( 1 ) 

1 5 . Ver\^ short patch repair (VSR) endonuclease ( 1 ) 
15 16. TnsA endonuclease, N-terminal domain (1) 

1 7. HoUiday junction resolvase (Endonuclease I) (1) 

2. tRNA splicing endonuclease, C-terminal domain (1) 

1 . tRNA splicing endonuclease, C-terminal domain (1 ) 

3. Eukaryotic RPB5 N-terminal domain (1) 

20 1 . Eukaryotic RPB5 N-terminal domain (1) 

53. Resolvase-like (2) 

Core: 3 layers: a/b/a; mixed beta-sheet of 5 strands, order 21345; strand 5 is 
antiparallel to the rest 

1 . Resolvase-like (2) 

25 1 . gamma, delta resolvase, large fragment (1) 

2. 5' to 3' exonuclease (5) 

contains additional strand and alpha-helical arch; strand 
order 321456; strand 6 is antiparallel to the rest 

2. beta-carbonic anhydrase (1) 

30 1. beta-carbonic anhydrase (2) 
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IIA domain of mannose transporter, IIA-Man (1) 

3 layers: a/h/a; mixed beta-sheet of 5 strands, order 21345; strand 5 is 
antiparallel to the rest 

1 . HA domain of mannose transporter, IIA-Man (1) 
active dimer is formed by strand 5 swapping 
1. HA domain of mannose transporter, IIA-Man (1) 

Ribonuclease H-Iike motif (7) 

3 layers: a/h/a; mixed beta-sheet of 5 strands, order 32145; strand 2 is 
antiparallel to the rest 

1 . Actin-like ATPase domain (4) 
duplication contains two domains of this fold 

1. Actin/HSP70 (8) 

2 . Acetate kinase ( 1 ) 

3. Hexokinase (3) 

4. Glycerol kinase ( 1 ) 

2. Creatinase/prolidase N-terminal domain (1 ) 

1 . Creatinase/prolidase N-terminal domain (2) 

3. Ribonuclease H-like (6) 
consists of one domain of this fold 

1 . Ribonuclease H (4) 

2. Retroviral integrase, catalytic domain (3) 

3. mu transposase, core domain (1) 

4. Transposase inhibitor (Tn5 transposase) (1) 

5. DnaQ-like 3'-5' exonuclease (11) 

6. RuvC resolvase (1) 

A, Translational machinery components (2) 

1 . Ribosomal protein LI 8 and S 1 1 (2) 

2. Middle domain of eukaryotic peptide chain release factor 
subunit 1,ERF1 (1) 

5. Hypothetical protein MTHl 175 (1) 
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TABLE 1 (continued) 

1 . Hypothetical protein MTH 1175(1) 

6. DNA repair protein MutS, domain II (1 ) 

1 . DNA repair protein MutS, domain II (2) 

7. Methylated DNA-protein cysteine methyhransferase domain (1) 

1 . Methylated DNA-protein cysteine methyhransferase domain (3) 
Phosphorylase/hydrolase-like (6) 

core: 3 layers, a/b/a ; mixed sheet of 5 strands: order 21354: strand 4 is 
antiparallel to the rest; contains crossover loops 

1 . Hydrogenase maturating endopeptidase HybD (1) 
the fold coincides with the consensus core structure 

1 . Hydrogenase maturating endopeptidase HybD ( 1 ) 

2. Purine and uridine phosphorylases (1) 

complex architecture; contains mixed beta-sheet of 8 strands, order 
23415867, strands 3, 6 &7 are antiparallel to the rest; and barrel, 
closed; n=5, S=8 

1 . Purine and uridine phosphorylases (6) 

3. Peptidyl-tRNA hydrolase (1) 

1 , Peptidyl-tRNA hydrolase (1) 

4. Pyrrolidone carboxyl peptidase (pyroglutamate aminopeptidase) (1) 
1 . Pyrrolidone carboxyl peptidase (pyroglutamate 

aminopeptidase) (2) 

5 . Zn-dependent exopeptidases (5) 

core: mixed beta-sheet of 8 strands, order 12435867; strands 2, 6 &7 
are antiparallel to the rest 

1 . Pancreatic carboxypeptidases (6) 

2. Carboxypeptidase T (1) 

3. Leucine aminopeptidase, C-terminal domain (1) 

4. Bacterial exopeptidases (3) 

5. Transferrin receptor ectodomain, protease-like domain (1) 

6. LigB subunit of an aromatic-ring-opening dioxygenase LigAB (1) 
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TABLE 1 (continued) 

circular permutation of the common fold, most similar to the PNP fold 
1 . LigB subunit of an aromatic-ring-opening dioxygenase LigAB 
(1) 

Molybdenumm cofactor biosynthesis protein MogA (1) 
3 layers: a/b/a; mixed beta-sheet of 5 strands; order: 21354, strand 5 is 
antiparallel to the rest; permutation of the Phosphorylase/hydrolase-likefold 
1 . Molybdenumm cofactor biosynthesis protein MogA ( 1 ) 

1 , Molybdenumm cofactor biosynthesis protein MogA ( 1 ) 
Amino acid dehydrogenase-like, N-terminal domain (1) 

3 layers: a/b/a; mixed beta-sheet of 5 strands; 12435, strand 2 is antiparallel 
to the rest 

1 . Amino acid dehydrogenase-like, N-terminal domain (3) 

1 . Amino acid dehydrogenases (7) 
dimerisation domain 

2. Tetrahydrofolate dehydrogenase/cyclohydrolase (3) 

3 . Mitochondrial NAD(P)-dependent malic enz>ine ( 1 ) 

this domain is decorated with additional structures; includes 
N-terminal additional subdomains 
Glutamate ligase domain (1) 

3 layers: a/b/a; mixed beta-sheet of 6 strands, order 126345; strand 1 is 

antiparallel to the rest 

1 . Glutamate ligase domain (2) 

1 . MurD/MurF C-terminal domain (2) 

2. Folylpolyglutamate synthetase, C-terminal domain (1) 
Phosphoglycerate mutase-like (1) 

core: 3 layers, a/b/a; mixed beta-sheet of 6 strands, order 324156; strand 5 is 

antiparallel to the rest 

1 . Phosphoglycerate mutase-like (4) 

1 . Phosphoglycerate mutase ( 1 ) 

2. Acid phosphatase (2) 
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TABLE 1 (continued) 

3 . Phytase (myo-inositol-hexakisphosphate-3-phosphohydrolase) 
(3) 

4. 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase, 
phosphatase domain (1) 

PRTase-hke(l) 

core: 3 layers, a/b/a; mixed beta-sheet of 6 strands, order 321456; strand 3 is 
antiparallel to the rest 
L PRTase-like (2) 

1 . Phosphoribosyltransferases (PRTases) (14) 

2. Phosphoribosylpyrophosphate synthetase (1) 
duplication: consists of two domains of this fold 

Integrin A (or I) domain (1) 

core: 3 layers, a/b/a; mixed beta-sheet of 6 strands, order 321456; strand 3 is 

antiparallel to the rest 

1 . Integrin A (or I) domain (1) 

1 . Integrin A (or I) domain (7) 
Glutaconate-CoA transferase subunits (1) 

core: 3 layers: a/b/a; parallel or mixed b-sheet of 6 strands, order 432 J 56; 
part of sheet is folded upon itself and forms a barrel-like structure 
1 . Glutaconate-Co A transferase subunits ( 1 ) 

1 . Glutaconate-CoA transferase subunits (2) 
Pyruvate-ferredoxin oxidoreductase, PFOR, domain HI (1) 
3 layers: a/b/a; mixed beta-sheet of 6 strands, order 231456; strand 3 is 
antiparallel to the rest 

1 . Pyruvate-ferredoxin oxidoreductase, PFOR, domain in ( 1 ) 

I . Pyruvate-ferredoxin oxidoreductase, PFOR, domain III (1 ) 
Formyltransferase (1) 

3 layers: a/b/a; mixed beta-sheet of 7 strands, order 3214567; strand 6 is 

antiparallel to the rest 

1 . Formyltransferase ( 1 ) 
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TABLE 1 (continued) 

1 . Formyltransferase (2) 
S-adenosyl-L-methionine-dependent methyl transferases (1) 
core: 3 layers, a/h/a; mixed beta-sheet of 7 strands ^ order 3214576; strand 7 
is antiparallel to the rest 

1 . S-adenosyl-L-methionine-dependent methyltransferases (11) 

1 . Catechol O-methyltransferase, COMT ( 1 ) 

2 . RNA methyltransferase Fts J ( 1 ) 

3 . Fibrillarin homologue ( 1 ) 

4. Hypothetical protein MJ0882 ( 1 ) 

5 . Glycine N-methyltransferase ( 1 ) 

6. Arginine methyltransferase, HMTl (1) 

lacks the last two strands of the common fold replaced with a 
beta-sandwich oligomerisation subdomain 

7. Protein-L-isoaspartate O-methyltransferase (1 ) 
another C- terminal variation of the common fold with 
additional alpha+beta subdomain 

8. Chemotaxis receptor methyltransferase CheR, C-terminal 
domain (1) 

contains additional N~terminal all-alpha domain, res, 11-91 

9. RNA methylases (3) 

10. DNA methylases (5) 

1 1 . Type n DNA methylase (2) 

circularly permuted version of the common fold 
PLP-dependent transferases (1) 

main domain: 3 layers: a/b/a, mixed beta-sheet of 7 strands, order 3245671; 
strand 7 is antiparallel to the rest 
1 . PLP-dependent transferases (5) 
L AAT-like(9) 

2. Beta-eliminating lyases (2) 

3. Cystathionine synthase-like (8) 
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TABLE I (continued) 

4. omega- Amino acid:pyruvate aminotransferase-like (15) 

5 . Ornithine decarboxylase major domain ( 1 ) 
Nucleotide-diphospho-sugar transferases (1) 

3 layers: a/b/a; mixed beta-sheet of 7 strands, order 3214657; strand 6 is 
antiparallel to the rest 

1 . Nucleotide-diphospho-sugar transferases (8) 

1 . Spore coat polysaccharide biosynthesis protein SpsA (1) 

2. beta 1,4 galactosyltransferase (b4GalTl) (1) 

3. CMP acylneuraminate synthetase (1 ) 

4. Galactosyltransferase LgtC (1) 

5. N-acetylglucosamine 1 -phosphate uridyltransferase GlmU, 
N-terminal domain (1) 

6 . glucose- 1 -phosphate thymidylyltransferase RmlA ( 1 ) 

7. 1,3-Glucuronyltransferase I (glcAT-I) (1) 

8. Molybdenum cofactor biosynthesis protein MobA (1) 
alpha/beta-Hydrolases (1) 

core: 3 layers, a/b/a; mixed beta-sheet of 8 strands, order 12435678, strand 2 

is antiparallel to the rest 

1. alpha^eta-Hyd^olases (20) 

many members have left-handed crossover connection between strand 

8 and additional strand 9 



1. 


Acetylcholinesterase-like (8) 


2. 


Carboxylesterase (2) 


3. 


Mycobacterial antigens (2) 


4. 


Prolyl oligopeptidase, C-terminal domain (1) 


5. 


Serine carboxypeptidase (4) 


6. 


Gastric lipase (1) 


7. 


Proline iminopeptidase (2) 


8. 


Haloalkane dehalogenase (3) 


9. 


Dienelactone hydrolase (2) 
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16. 


Lipase (1) 


17. 


Fungal lipases (9) 


18. 


Bacterial lipase (5) 


19. 


Pancreatic lipase, N-terminal domain (6) 


20. 


Hydroxynitrile lyase (2) 



Nucleoside hydrolase (1) 

core: 3 layers, a/b/a ; mixed beta-sheet of 8 strands, order 32145687; strand 7 

is antiparallel to the rest 

1 . Nucleoside hydrolase ( 1 ) 

1 . Nucleoside hydrolase (2) 
Dihydro folate reductases (1) 

3 layers: a/b/a; mixed beta-sheet of 8 strands, order 34251687; strand 8 is 

antiparallel to the rest 

1 . Dihydrofolate reductases ( 1 ) 

1 . Dihydrofolate reductases (10) 
Ribokinase-like (2) 

core: 3 layers: a/b/a; mixed beta-sheet of 8 strands, order 21345678, strand 7 
is antiparallel to the rest 

potential superfamily: members of this fold have similar functions but different 

ATP-binding sites 

1 . Ribokinase-like (2) 

has extra strand located between strands 2 and 3 

1 . Ribokinase-like (3) 

2. Hydroxyethylthiazole kinase (thz kinase) (1) 
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TABLE 1 (continued) 

2. MurD-like peptide ligases, catalytic domain (2) 
has extra strand located between strands 1 and 2 

1. MurD/MurF(2) 

2. Folylpolyglutamate synthetase (1) 
Carbamate kinase-like (1) 

3 layers: a/b/a; mixed (mainly parallel) beta-sheet of 8 strands, order 
34215786; strand 8 is antiparallel to the rest 
1 . Carbamate kinase-like (1) 

topologically similar to the N-terminal domain of phosphoglycerate 

kinase 

1 . Carbamate kinase-like (2) 
Class II aldolase (1) 

3 layers: a/b/a; mixed (mostly antiparallel) beta-sheet of 9 strands, order 
432159876; left-handed crossover between strands 4 and 5 
1. Class n aldolase (1) " 

L Class II aldolase (1) 

metal (zinc)-ion dependent 
Cytosolic phospholipase A2 catalytic domain (1) 

3 layers: a/b/a; mixed beta-sheet of 9 strands, order 654321789; strands 4, 6 
and 8 are antiparallel to the rest 

1 . Cytosolic phospholipase A2 catalytic domain (1) 

1 , Cytosolic phospholipase A2 catalytic domain (1) 
Phosphatase/sulphatase (1) 

3 layers: a/b/a; mixed beta-sheet of 10 strands, order 56437 1892A, (A~10) 
strand 9 is antiparallel to the rest 
1 . Phosphatase/sulphatase (2) 

1 . Alkaline phosphatase ( 1 ) 

2. Arylsulfatase (2) 

Isocitrate & isopropylmalate dehydrogenases (1) 

consists of two intertwined (sub)domains related by pseudodyad; duplication 
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TABLE 1 (continued) 

3 layers: a/b/a; single mixed beta-sheet of 10 strands, order 213A945867 
(A=10); strands from 5 to 9 are antiparallel to the rest 
1. Isocitrate & isopropylmalate dehydrogenases (1) 

1 . Isocitrate & isopropylmalate dehydrogenases (7) 
ATC-hke (2) 

consists of two similar domains related by pseudodyad; duplication 
core: 3 layers, a/b/a, parallel beta-sheet of 4 strands, order 2134 

1 . Aspartate/ ornithine carbamoyltransferase ( 1 ) 

1 . Aspartate/ornithine carbamoyltransferase (6) 

2 . Glutamate racemase ( 1 ) 

1 . Glutamate racemase ( 1 ) 

C-terminal extension is added to the N-terminal domain 
Tryptophan synthase beta subunit-like PLP-dependent enzymes (1) 
consists of two similar domains related by pseudodyad; duplication 
core: 3 layers, a/b/a; parallel beta-sheet of 4 strands, order 3214 
1. Tryptophan synthase beta subumt-Uke PLP-dependent enzymes (1) 

1 . Tryptophan synthase beta subunit-like PLP-dependent enzymes 
(4) 

SIS domain (1) 

consists of two similar domains related by pseudodyad; duplication 
3 layers: a/b/a; parallel beta-sheet of 5 strands, order 21345 
L SIS domain (2) 

1 , "Isomerase domain" of glucosamine 6-phosphate synthase 
(GLMS) (1) 

2. Phosphoglucose isomerase, PGI (2) 
permutation of the superfamily fold 

Formate dehydrogenase/DMSO reductase, domains 1-3 (1) 
contains of two similar intertwined domains related by pseudodyad; 
duplication 

core: 3 layers: a/b/a; parallel beta-sheet of 5 strands, order 32451 
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TABLE 1 (continued) 

1. Formate dehydrogenase/DMSO reductase, domains 1-3 (1) 
molybdopterine enzyme 

1 . Formate dehydrogenase/DMSO reductase, domains 1-3 (6) 

domain 1 (residues 1-55) binds Fe4S4 cluster in FDH but not 
DMSO reductase 
Aldehyde reductase (dehydrogenase), ALDH (1) 

consists of two similar domains with 3 layers (a/b/a) each; duplication 

core: parallel beta-sheet of 5 strands, order 32145 

1 . Aldehyde reductase (dehydrogenase), ALDH (1) 

binds NAD differently from other NAD (P) -dependent oxidoreductases 
1 . Aldehyde reductase (dehydrogenase), ALDH (8) 

Aconitase, first 3 domains (1) 

consists of three similar domains with 3 layers (a/b/a) each; duplication 
core: parallel beta-sheet of 5 strands, order 32145 
1. Aconitase, first 3 domains (1) 

1 . Aconitase, first 3 domains (2) 
contains Fe(4)-S(4) cluster 
Phosphoglucomutase, first 3 domains (1) 

consists of three similar domains with 3 layers (a/b/a) each; duplication 
core: mixed beta-sheet of 4 strands, order 2134, strand 4 is antiparallel to the 
rest 

\, Phosphoglucomutase, first 3 domains (1) 

1. Phosphoglucomutase, first 3 domains (1) 
L-fiicose isomerase, N-terminal and second domains (1) 
consists of two domains of similar topology, 3 layers (a/b/a) each 
Domain 1 (1-173) has parallel beta-sheet of 5 strands, order 21345 
Domain 2 (174-355) has parallel beta-sheet of 4 strands, order 2134 
1 , L-fiicose isomerase, N-terminal and second domains (1) 

L L-fiicose isomerase, N-terminal and second domains (1) 
Phosphoglycerate kinase (1) 
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TABLE 1 (continued) 

consists of two non-similar domains, 3 layers (a/b/a) each 
Domain 1 has parallel beta-sheet of 6 strands, order 342156 
Domain 2 has parallel beta-sheet of 6 strands, order 321456 
1 . Phosphoglycerate kinase ( 1 ) 

1 . Phosphoglycerate kinase (4) 
Domain 2 binds ATP 
UDP-Glycosyltransferase/glycogen phosphorylase (1) 
consists of two non-similar domains with 3 layers (a/b/a) each 
domain 1: parallel beta-sheet of 7 strands, order 3214567 
domain 2: parallel beta-sheet of 6 strands, order 321456 
1 . UDP-Glycosyltransferase/ glycogen phosphorylase (4) 

1 . beta-Glucosyltransferase (DNA-modifying) ( 1 ) 

2. Peptidoglycan biosynthesys glycosyltransferase MurG (1) 

3 . XJDP-N-acetylglucosamine 2-epimcrase ( 1 ) 

4. Oligosaccharide phosphorylase (4) 
Glutaminase/Asparaginase (1) 

consists of two non-similar alpha/beta domains, 3 layers (a/b/a) each 
Domain 1 has mixed beta-sheet of 6 strands, order 213456, strand 6 is 
antiparallel to the rest; left-handed crossover connection between strands 4 
and 5 

Domain 2 has parallel beta-sheet of 4 strands, order 1234 
1 . Glutaminase/Asparaginase ( 1 ) 

1 . Glutaminase/Asparaginase (5) 
Phosphofructokinase (1) 

consists of two non-similar domains, 3 layers (a/b/a) each 

Domain 1 has mixed sheet of 7 strands, order 3214567; strands 3 & 7 are 

antiparallel to the rest 

Domain 2 has parallel sheet of 4 strands, order 2314 
1. Phosphofinctokinase (1) 

1 . Phosphofructokinase (2) 
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TABLE 1 (continued) 

Domain I binds ATP 
Cobalt precorrin-4 methyltransferase CbiF (1) 
consists of two non-similar domains 

Domain 1 has antiparallel sheet of 5 strands, order 32415 

Domain 2 has mixed sheet of 5 strands, order 12534; strands 4 & 5 are 

antiparallel to the rest 

1 . Cobalt precorrin-4 methyltransferase CbiF ( 1 ) 

1 . Cobalt precorrin-4 methyltransferase CbiF ( 1 ) 
Phosphoenolpyruvate carboxykinase (ATP-oxaloacetate carboxy-liase) (1) 
consists of two alpha/beta domains 

duplication: the domains share an unusual fold of 2 helices and 6-stranded 
mixed sheet; beta(2)-alpha'beta(4)-alpha; order 312465, strands 1 and 5 are 
antiparallel to the rest 

1 . Phosphoenolpyruvate carboxykinase (ATE-oxaloacetate carboxy-liase) 
(1) 

domain 2 contains the P-loop ATP-binding motif 
1 . Phosphoenolpyruvate carboxykinase (ATP-oxaloacetate carboxy-liase) 
(1) 

Chelatase-like (2) 

duplication: tandem repeat of two domains; 3 layers (a/b/a); parallel 
beta-sheet of 4 strands, order 2134 

1. Chelatase(2) 

interdomain linker is short; swapping of C-terminal helices between 
the two domains 

1 . Ferrochelatase ( 1 ) 

2. Cobalt chelatase CbiK (1) 

2. "Helical backbone" metal receptor (3) 

contains a long alpha helical insertion in the interdomain linker 

1 . Periplasmic ferric siderophore binding protein FhuD (1) 

2. TroA-like (2) 
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TABLE 1 (continued) 

3. Nitrogenase iron-molybdenum protein (3) 

contains three domains of this fold; ''Helical backbone'' holds 
domains 2 and 3 
Periplasmic binding protein-like 1(1) 

consists of two similar intertwined domain with 3 layers (a/b/a) each: 
duplication 

parallel beta-sheet of 6 strands, order 213456 
1 . Periplasmic binding protein-like I (1) 

Similar in architecture to the superfamily 11 but partly differs in 

topology 

1 . L-arabinose binding protein-like (13) 
Periplasmic binding protein-like II (1) 

consists of two similar intertwined domain with 3 layers (a/b/a) each: 
duplication 

mixed beta-sheet of 5 strands, order 21354; strand 5 is antiparallel to the rest 
1 , Periplasmic binding protein-like n (2) 

Similar in architecture to the superfamily I but partly differs in 

topology 

1 . Phosphate binding protein-like (20) 

2. Transferrin (8) 

further duplication: composed of two two-domain lobes 
Thiolase-like (1) 

consists of two similar domains related by pseudodyad; duplication 

3 layers: a/b/a; mixed beta-sheet of 5 strands, order 32451; strands 1 & 5 are 

antiparallel to the rest 

1. Thiolase-like (2) 

1 . Thiolase-related (6) 

2, Chalcone synthase (2) 
Fe-only hydrogenase ( 1 ) 

consist of two intertwined domains; contains partial duplication 
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TABLE 1 (continued) 

1 . Fe-only hydro genase ( 1 ) 

1 . Fe-only hydrogenase (2) 
Cytidine deaminase (1) 

consists of two very similar domains with 3 layers (a/b/a)each; duplication 
mixed beta-sheet of 4 strands, order 2134; strand 2 is antiparallel to the rest 
1 . Cytidine deaminase ( 1 ) 

1 . Cytidine deaminase ( 1 ) 
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TABLE 2 
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Cmpd E 
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TABLE 2 (contM.) 



CmpdF 



Cmpd G 



Cmpd H 




Cmpd I 



TABLE 2 (cont'd.) 




Cmpd N 



69 



TABLE 2 (contM.) 




CmpdS 
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CmpdT 



CmpdU 



CmpdV 



Cmpd W 







CmpdX 
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TABLE 2 (cont'd.) 




Cmpd AA 




Cmpd AC 
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TABLE 2 (cont'd,) 




Cmpd AH 
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TABLE 2 (cont' .d) 




TABLE 2 (cont'd.) 



Cmpd AS 
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TABLE 2 (cont'd.) 




78 




79 



TABLE 2 (cont'd- ) 




Cmpd AAG 
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The present invention is illustrated by the following examples. 



Example 1 

5 Identification of Alpha/Beta Proteins and AUosteric Regulatory Sites 

The present invention also provides methods of identifying a molecule 
which IS not LFA-1 or an I domain containing fragment thereof, said molecule 
comprising an a/p domain structure, said a/p structure comprising an allosteric 
regulatory site. When said molecule is contacted with an allosteric effector moiecule, 

10 allosteric regulatory sites such as, for example, I domain allostenc sites, interact with 

said allosteric effector molecule to promote a conformation in a ligand binding 
domain of said a/p structure that modulates binding between the first molecule and a 
binding partner molecule thereof, 

Allosteric regulatory sites can be identified, for example, by comparing 

15 candidate proteins to proteins having known allosteric regulatory sites. For example, 

a/p proteins having allosteric regulatory sites may be identified by using search tools, 
such as a NCBI vector alignment search tool (or ''VAST" search), which are able to 
identify proteins similar to a predetermined three dimensional structure [Gibrat et al, 
Curr. Opin. StrucL Biol, 6:377-385 (1996)], incorporated by reference herein in its 

20 entirety; and, Madej et al. Proteins 23:356-369 (1995), incorporated by reference 

herein in its entirety]. With respect to these methods, LFA-1 can be used as a 
compaiison or query protein because LFA-1 is known to include an I domain 
allosteric site. Similarly, other a/p proteins knovm to comprise an allosteric site can 
be used as a reference to identify other a/p proteins comprising an I domain allosteric 

25 site. In one embodiment, proteins with a VAST score of 7 or greater or a P value of 

0.005 or less may be defined as being sufficiently related to the comparison protein to 
warrant fiirther investigation. 

Allosteric regulatory sites may also be identified by using an algorithm 
that predicts conformational ambivalence [Young et aL, Protein Science 8:1752-1764 

30 (1999), incorporated by reference herein in its entirety; and, Kirshenbaum et al. 

Protein Science 8(9):1806-1815 (1999), incorporated by reference herein in its 
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entirety]. This algorithm, referred to as the Ambivalent Structure Predictor ("ASP"), 
predicts regions of three-dimensional conformational rearrangement from amino acid 
sequence information. The algorithm uses scaled probabilities from a secondary 
structural prediction algorithm, Profile Network Prediction Heidelberg ("PHD") 
[Rost, Meth, Enzymol. 266:525-539 (1996), incorporated by reference herein in its 
entirety], to identify structurally ambivalent sequence elements. Residues possessing 
a z score below -1 .75 standard deviations of the mean residue ambivalence score in 
a/p domains are understood as being consistent with an allosteric regulatory site of the 
type useful according to the present invention. 

For example. Table 3 shows that the integrin a/p domains and their 
close relatives possess a high VAST core of approximately 10 or greater and a P value 
of approximately 0.0009 or less relative to two representatives LFA-1 and Mac-1. 
Further, Table 3 indicates that the position of structurally ambivalent sequence 
elements (S ASE) is consistent with the known or predicted c-terminal rigid body 
motion for these domains. Accordingly, these and other closely related domains of 
this type are predicted to possess a typical IDAS. Moreover, as demonstrated by the 
calculations presented in Table 3, some Ras superfamily members such as RhoA and 
enzymes such as ENR are also predicted to possess a typical IDAS. 

Additionally, some non-integrin oc/p domains that are more distantly 
related, as demonstrated by VAST analysis, possess a SASE at a site that appears to 
be distinct from the typical integrin IDAS. These a/p domains may possess an IDAS- 
like site also capable of being modulated with a small molecule such as a diaryl 
compound. 

Many a/p domains share less than 35% amino acid identity. Therefore, 
a web-based simple modular architecture research tool, SMART, [see Schultz et al., 
Nuc. Acids Res,, 28:231-234 (2000), incorporated by reference herein in its entirety; 
Copley et al., Curr. Opin, Struct. Biol. 9:408-415 (1999), incorporated by reference 
herein in its entirety; Pouting et al, Nuc. Acids Res. 27:229-232 (1999), incorporated 
by reference herein in its entirety; and, Schultz et al, PNAS USA 95:5857-5864 
(1998), incorporated by reference herein in its entirety] that compares query sequences 
with its database of domain sequences has been used to identify additional divergent 
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family members. SMART utilizes multiple sequence alignments of representative 
family members. These alignments are optimized manually, and following the 
generation of a hidden Markov model, can be used to search sequence databases. 
Significantly similar sequences are added to the alignment, thereby refining the model 
which is used for subsequent searches. Accordingly, the SMART database may be 
used as a source of identifying additional a/p domains of interest to analyze for the 
presence of an allosteric regulatory site. 
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TABLE 3 



VAST Structure Neighbor 



ASP 




*SASE- Structurally ambivalent sequence element. 

t C TexmLal SASE not detected by ASP defeult settmgs. 

t Second site of SASE may represent IDAS-hke site. 

Example 2 
CDllb I Domain Mutants 

A. GeneratimofMutat^^ 

In view of previous results [Huth, et «/., Proc. Natl. Acad. Sci. (USA) 
97-5231-5236 (2000)] using CDl la variants with mutations in the I domain, 
n.utat.ons were introduced in CDl lb m an attempt to identify CDl lb variants with 
increased affinity for binding partners ICAM-land iC3b. 
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Six mutations were generated using a QuikChange Site-Directed 
Mutagenesis Kit (Stratagene). These mutants included single changes of Asp'^"^ 
(D156A), Val'^" (V254A), Gln'^' (Q327A), Ile^'' (I332A), Phe^^' (F333A) and Glu'^' 
(E336A) to Ala. Briefly, two mutagenic oligonucleotides (one to the sense strand and 
5 one to the antisense strand) were synthesized which were used in PGR with full-length 

GDI lb as template. The PGR conditions for mutants D156A, V254A, Q327A, and 
1332 A included 1 cycle at 95^C for 30 seconds followed by 16 cycles of for 30 
seconds, SO^'G for 1 minute and 60''G for 18 minutes. PGR conditions for mutants 
F333 A and E336A were the same except that the final elongation step was carried out 

10 at 68°G for 20 minutes in the 16 cycles. After the PGR was complete, the methylated, 

non-mutated template DNA was digested with Dpnl at 3TC for 1 hour and the 
mutagenized GDI lb DNA was used to transform Supercompetent XLl Blue Cells 
(Stratagene) according to the manufacturer's suggested protocol. Garbomycin 
resistant colonies were picked and grown in liquid culture, after which plasmid DNA 

15 was isolated and the insert was sequenced. From clones having full-length mutants, a 

1 .3 kb SacVEcoRY fragment containing the 5' portion of the gene was subcloned back 
into the parental vector. The inserts from these subclones were sequenced to verify 
the integrity of the junctions and the presence of the mutation. 



20 D 1 56A (sense) SEQ ID NO: 1 

GATTGGGTTGTTGATTGGGGGGTGTGGTAGGATG 
V254A (sense) SEQ ID NO: 2 

GGGTTTAAGATGGTAGGGGTGATGACGGATGGAG 
Q327A (sense) . SEQ ID NO* 3 

25 GAAGACGATTCAGAAGGGGCTTGGGGAGAAGATG 

I332A (sense) SEQ ID NO: 4 

GAGGTTGGGGAGAAGGGGTTTGGGATGGAGGG 
F333 A (sense) SEQ ID NO: 5 

CTTGGGGAGAAGATGGGGGGGATGGAGGGTAC 
30 E336A (sense) SEQ ID NO: 6 

GAAGATGTTTGCGATGGGGGGTAGTGAGAGAGG 

B. COS-7 Transfections 

GOS cells were co-transfected with CDlS/pDGl and either wild-type 
35 GDI lb or a mutant form of GDI lb. Transfections were performed essentially as 
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previously described [Huth, et al, Proc. Natl. Acad. Sci. (USA) 97:5231-5236 
(2000)]. 

C. FACS Analysis 

5 FACS analysis was carried out as previously described [Huth, et al, 

Proc. Natl. Acad. Sci. (USA) 97:5231-5236 (2000)] except that the anti-CDl lb 
monoclonal antibody TMG6-5 [Diamond, etal, J. Cell Biology 120:1031-1043 
(1993)] was used to confirm CDl lb expression. 

10 D. Adhesion Assay with COS Transfected Cells and Immobilized ICAM-1 or 
iC3b 

Adhesion assays were performed in 96-well'Easy Wash plates 
(Coming Glass, Coming, NY) using a modified procedure [Sadhu, et al. Cell Adhes. 
Commun, 2:429-440 (1994)]. Each well was coated overnight at 4T with 50 ^1 of 

15 glycophorin (Calbiochem) (10 fig/ml), ICAM-L^c (5 ^ig/ml), iC3b (3 fjig/ml) or with 

anti-CD18 monoclonal antibody (TSl/18, 5 p-g/ral) and anti-CDl lb monoclonal 
antibody (44AACB [ATCC], 5 ^g/ml) in 50 mlM bicarbonate buffer (pH 9.6), or 
buffer alone. Plates were washed twice with 200 jil/well D-PBS and blocked with 1% 
HSA (100 |il/well) in D-PBS for 1 hr at room temperature. Wells were rinsed once 

20 with 100 \i\ of adhesion buffer (containing RPMI and 5.0% inactivated FBS) and 100 
|il adhesion buffer was added to each well. Another 100 |il of adhesion buffer, with 
or without control antibody (IgG(5a)7:2, 60 iig/ml), blocking antibody (44AACB, 60 
|ig/ml) or activating antibody 240Q [Huth, et aL, Proc. Natl. Acad. Sci. (USA) 
97:5231-5236 (2000)] at 60 ^g/ml was added to each well, after which COS-7 

25 transfectants (100 yd of 0.75 x 10^ cells/ml) in adhesion buffer were added to each 

well. The plates were incubated at 37''C for 30 minutes for ICAM-1 binding or 15 
minutes for iC3b binding. Adherent cells were fixed by the addition of 50 |il/well 
14% glutaraldehyde in D-PBS and incubation continued at room temperature for 1,5 
hr. The plates were washed with dH20, stained with 100 jil/well 0.5% crystal violet 

30 in 10% ethanol for 5 minutes at room temperature, and washed in several changes of 

dH20. After washing, 70%o ethanol was added and adherent cells were quantitated by 
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determining absorbence at 570 nm and 410 nm using a SPECTRmax 250 microplate 
spectrophotometer system (Molecular Devices, Sunnyvale, CA). Percentage of cell 
binding was determined using the formula below. 

% of cell binding - A570 - A410(binding to ICAM-1 or iC3b) x 1 00 

A570-A410 (binding to CD18+CD1 lb monoclonal antibodies) 

Results indicated that wild type CDl lb binding to ICAM-1 and iC3b 
was 3.1% and 26.4%, respectively. Mutants V254A, Q327A, and 1332 A each 
demonstrated significantly higher binding to ICAM-1 (114.7%, 105,1%, and 123.1% 
of wildtype levels, respectively) and iC3b (147.1%, 140.5%, and 205.2%^, 
respectively), while mutants F332A and E336A showed significantly lower binding to 
both ICAM-1 (1.1% and 0.7%, respectively) and iC3b (4.9% and 4.3%, respectively). 
Mutants which demonstrate higher levels of ICAM-1 binding are therefore useful for 
identifying compounds that inhibit CD18/CD1 lb (Mac-1) binding to ICAM-1 in 
providing a higher signal-to-noise ratio as a result of the mcreased level of ICAM-1 
binding. 

Example 3 
Identification of CDllb Agonists 

Previous work has demonstrated that various diaryl compounds can 

inhibit LFA-1 binding to ICAM-1 . In view of this observation and the results in 

Example 1 above, experiments were designed to determine if diaryl compounds can 

affect CDl lb binding to natural binding partners, presumably through interaction with 

an allosteric regulatory region of CDl lb. 

A. Adhesion Assay of HL60 Expressing to Immobilized ICAM-1 

In order to assess the ability of the test compounds to modulate CDl lb 
(o:^ binding, adhesion assays were performed using HL60 cells and immobilized 
ICAM-1 . 

Assays were performed in the presence of blocking anti-CD 18 
monoclonal antibody (TSl/22, 10 jxg'ml) with 100 [il of HL60 cells (1 x 10^ cells/ml) 
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in adhesion buffer were performed in 96-well Easy Wash plates (Coming Glass, 
Coming, NY) using the procedure described above except that each well was coated 
overnight at 4^C with (i) 50 \i\ ICAM-l/Fc (5 ^ig/ml), (ii) anti-CD 18 monoclonal 
antibody (22F12C, 5 \xg/mD and anti-alpha 4 monoclonal (A4.1, 5 fig/ml) in 50 mM 
5 bicarbonate buffer (pH 9.6), or (iii) buffer alone. Percentage of cell binding was 

detemiined using the formula below. 



10 
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% Binding = A570 - A410fbinding to ICAM4) X 100 

A570-A410 (binding to CD18+CDlla mAb) 

Data was then normalized using the formula: 

% of DMSO binding = % of cell binding, inhibitors X 100 

% of cell binding, DMSO 

Approximately 30 compounds were identified for further study, IC50 
values were detemiined in the HL-60 assay described above or in a neutrophil binding 
assays with fibrinogen described below (Example 15). 



20 B, Adhesion Assay of JY/CDllb Cells to Immobilized iC3b 

Briefly, each well of a 96-weIl plate was coated overnight at 4°C with 
50 \i\ glycophorin (10 fig/ml), iC3b (5 fJ-g/ml) or with anti-CD18 monoclonal antibody 
(22F12C, 5 \ig/ml) and anti-CDl lb monoclonal antibody (44AACB, 5 |Jig/ml) in 
bicarbonate buffer (pH 9.6). Plates were blocked with human serum albumin in D- 
25 PBS for one hr at room temperature. JY cells transfected with CDl lb (JY/CDl lb 

cells) ( 100 jil at 1 X 10^ cells/ml) in adhesion buffer were added to each well and 
incubation was carried out at 37°C for 30 min. Plates were fixed and analyzed as 
described above in Example 1. Percentage of cells binding was determined using the 
equation below. 

30 

% Binding - f A570 - A41 0(binding to iC3b) X 100 

A570-A410 (binding to CD18 + CDllb mAbs) 
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Data was normalized using the formula: 

% of DMSO binding = % of cell binding, inhibitors X 100 

*% of cell binding, DMSO 

5 

IC50 values were determined for 45 compounds that demonstrated 
inhibition in the screen and six of these compounds showed IC50 of less than 10 [iM. 
Twelve of the 45 compounds were subsequently used in binding assays using 
neutrophil adhesion to fibrinogen (described in Example 15). 
10 This screen also identified 17 compounds with the ability to stimulate 

binding to iC3b. Re-titration of these 17 compounds revealed that Cmpd H, Cmpd I, 
and Cmpd C were capable of dose-dependent stimulation of CDl lb/CD18 binding to 
iC3b at a level two times that observed with control DMSO treatment. 

15 Example 4 

Screening for Inhibitors of Complement Protein C2 and Factor B 

Complement proteins C2 and Factor B have been shown to include A 

domain regions which are believed to regulate serine protease activity of the proteins 

and their respective convertases. The A domains in these proteins are also believed to 

20 serve as ligand binding sites and to include one or more regulatory domains. C2 binds 

complement protein C4b to form the C3 convertase and part of the C5 convertase in 
the classical complement pathway, and Factor B binds C3b to form the alternative 
complement pathway C3 convertase and part of the C5 convertase. Identification of 
modulators for C2 or Factor B binding would presumably provide a mechanism by 

25 which C3 and/or C5 convertase activity can be controlled. 

A screen for inhibitors of the classical pathway complement protein C2 
and alternative pathway complement protein Factor B includes primary screening 
using modifications of standard hemolytic CH50 and AH50 assays in a microtiter 
plate format as described below. [See also Current Protocols in Immunology, Chapter 

30 13, Unit 13.1, John Wiley & Sons, Lie, ( 2000).] The CH50 assay is dependent on 

the activity of the classical pathway and C2, whereas the AH50 assay is dependent on 
the activity of the alternative pathway and Factor B. 
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The CH50 assay consists of analysis of complement-dependent lysis of 
sheep red blood cells (RBCs) which have been opsonized with anti-sheep RBC serum 
and is dependent on both Mg'^ and Ca""^. The CH50 is the concentration of human 
serum necessary to cause the lysis of 50% of the opsonized sheep RBC within 1 hour 
at 37T;. The primary screen for C2 inhibitors includes use of a constant serum 
concentration at the CH50 level, and the assay is conducted in the presence and 
absence of 10 |aM of test compounds. Compounds that inhibit this primary assay are 
titrated and retested for specificity in a secondary hemolytic assay in which each 
individual purified complement protein is added sequentially in the presence or 
absence of the test compound to determine which component is being inhibited. 

The AH50 assay consists of analysis of the direct 
complement-dependent lysis of rabbit red blood cells and is dependent on Mg^ but 
not Ca"^, and therefore is performed in the presence of EGTA. Similar to the CH50, 
the AH50 is the concentration of human serum necessary to cause the lysis of 50% of 
the rabbit RBC within 1 hour at 37°C. The primary screen for Factor B inhibitors 
includes use of a serum concentration at the AH50 level, and the assay is conducted in 
the presence and absence of 10 p-M of test compounds. Compounds that inhibit this 
primary assay are titrated and retested for specificity in a secondary hemolytic assay in 
which each individual purified complement protein is added sequentially in the 
presence or absence of the compound to determine which component is being 
inhibited. 

Sheep whole blood in Alsevers solution and anti-sheep hemolysin were 
obtained fi-om Colorado Serum Co. (Denver, CO). Erythrocyte-antibody complexes 
(EA) were produced using an optimal concentration of anti-sheep hemolysin, 
determined by titration to be a 1 :800 dilution. Normal human serum (NHS) was 
generated by collecting firesh serum from 10 random healthy human donors, pooling 
it, aliquotting the pooled serum, flash fi^eezing it in Uquid nitrogen, and storing it at 
-70°C. A fi-esh aliquot was thawed immediately prior to each use. 

A standard assay was established in Costar 96-well round-bottom or V- 
bottom microtiter plates. All samples were analyzed in duplicate and averaged. First, 
the NHS was titrated to determine the midpoint of its Unear activity in lysing the EA 
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(the CH50 dilution). Serial two-fold dilutions of freshly thawed NHS in gelatin- 
veronal buffer with Mg^"^ and Ca^' (GVB^^ containing 0.142 M NaCL 4.9 mM sodium 
5, 5'-diethylbarbituric acid and LO g/1 gelatin, tlie pH adjusted to 7.35 with HCl, 
followed by addition of CaCl2 and MgCl2 to final concentrations of 60 |aM and 400 
5 laM, respectively), or dH20 (used to determine total lysis) were placed in duplicate 

wells (80 |il/well) and warmed to 3T'C for 5 minutes. EA which had been washed 
twice with GVB' ' and resuspended at 2 x 10^ complexes/ml were added (80 ^il/well) 
and the plate was incubated at 3TC for 60 minutes. Eighty jil/well of 0.15 M NaCl 
was added and the plate was centrifuged at 2500 rpm for 3 minutes. One hundred 

1 0 |i.l/well of supernatant was transferred from the assay plate to an hnmulon4 96-well 

flat-bottom ELIS A plate and the absorbance at 420mn was determined. Background 
readings of absorbance in the wells containing no NHS were subtracted from the 
reading for all wells containing NHS and the resulting specific absorbance was 
expressed as a percentage of that obtained from wells containing dH20 (% Total 

15 Lysis), 

The dilution of NHS necessary to give 50% Total Lysis in 60 minutes 
at 37°C (the CH50) was determined to be 1 :150. This dilution constituted the 
midpoint of the linear range of the NHS lytic activity and was used to screen the 
library of test compounds for inhibitors of the complement pathway. The test 

20 compounds were first diluted in GVB'^/5% DMSO to 40 |iM and aliquotted at 40 

|al/well in duplicate into Costar 96-well round-bottom or V-bottom plates. Control ' 
wells containing GVB^ (background), dH20 (total lysis), DMSO alone, anti-C2 
polyclonal antisera (40 |ig/ml; Calbiochem), normal goat IgG (40 [xg/ml; Sigma), and 
EGTA (4 mM) were also included. Plates were incubated at 3TC for five minutes. 

25 Forty |il/well of NHS diluted to 1 :75 in GVB^ was added (this created a 1 :150 final 

dilution with compound), except in backgroimd or total lysis wells which received 
GVB^ or dH20, respectively. Plates were incubated at 37°C for 10 minutes. EA were 
washed twice, resuspended at 2 x lOVml in GVE*^, and added to each plate at 80 
lil/weli. The plates were incubated at 37''C for 60-70 minutes, after which 80 |il/well 

30 of 0. 15 M NaCl was added and the plates were centrifiiged at 2500 rpm for 3 minutes. 

One hundred |j,1 of supernatant from each well was transferred from the assay plates to 
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separate wells on Immulon4 96-well flat-bottom ELISA plates and the absorbance at 
420nm was analyzed. Background readings of absorbance in the wells containing no 
NHS were subtracted from the absorbance for each well and the resulting specific 
absorbance was expressed as a percentage of that obtained from wells containing 
DMSO alone (% DMSO lysis). All compounds which inhibited DMSO lysis by 
greater than 35% were re- tested and titrated in the same assay. Thirty nine 
compounds were identified with IC50 values of less than or equal to 20 |iM. The two 
most potent compoimds had IC50 values of less than 5 |-lM, and were shown to be 
selective for complement inhibition since they did not sigjlificantly inhibit (i) LFA-1 
mediated adhesion to ICAM-1, (ii) Mac-1 mediated adhesion to ICAM-1, (iii) ajPi 
mediated adhesion to collagen, (iv) a^p^ mediated adhesion to MAdCAM-1, or (v) 
vWf binding to gplb in standard cell-based adhesion assays at concentrations greater 
than or equal to 20 |xM. 

Approximately 30% of the activity of serum in the classical 
complement pathway (CCP) screen is due to amplification by the alternative 
complement pathway (ACP) Factor B containing C3 and C5 convertases. Therefore, 
this assay has the potential to isolate inhibitors of either the classical complement 
pathway convertases, the lectin complement pathway (LCP) (in which C3 is an 
intermediate component as well), and the altemative complement pathway. It is also 
possible that given the high degree of primary sthictural homology between C2 and 
Factor B, compounds may be isolated which inhibit both convertases in all three 
pathways. 

Given the nature of the original screen, inhibition could have occurred 
at any stage of the complement pathway. In order to determine at which stage of 
complement activation the test compounds inhibited activity, purified complement 
proteins were obtained (Advanced Research Technologies, San Diego, CA) and 
complement activation was reconstituted in a stepwise manner. At each step, the lead 
compound or DMSO alone was added and the terminal hemolytic activity was 
measured as above. Initially, the lead compoimd was tested for its ability to inhibit at 
any of four different stages of complement activation: 1) CI binding to aggregated 
antibody on the surface of the EA; 2) C4 binding to and cleavage by CI ; 3) C2 
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binding to C4b, activation of C2 by CI -mediated cleavage and C4bC2a-mediated 
cleavage of C3 (i.e., formation and activity of the C3 convertase); and 4) formation 
and activity of the C5 convertase and subsequent deposition of complement proteins 
C6 through C9, which form the membrane attack complex (MAC) resulting in cell 
lysis. 

In the assay, 1 x 1 0 EA/well were analyzed in duplicate wells of 
Costar 96-well round-bottom plates. For testing stage 1 (as indicated above), cells 
were resuspended in GVB"^' containing 7.5 |ag/ml CI protein and incubated for 15 
minutes at SO'^C. For testing stage 2, cells were resuspended in GVB"^ contaming 7.5 
|ag/ml C4 protein and incubated for 15 minutes at 30°C. For testing stage 3, cells 
were resuspended in GVB"^ containing 0.4 |ig/ml C2 protein and 25 |ig/ml C3 protein 
and incubated for 30 minutes at 30°C. For testing stage 4, cells were resuspended in 
GVB'' containing 4 mM EGTA and a 1 :50 dilution of NHS and incubated for 60 
minutes at 37°C. For each stage, a titration of the lead compound was carried out 
wherein the dilutions of the compound with DMSG, goat anti-C2 pIgG, and goat 
normal pIgG were tested for inhibition. Each pair of wells received inhibitors at only 
one stage. After each stage's incubation period, plates were centrifuged at 2400 RPM 
for 3 minutes, and cell pellets were washed twice with 1 00 (il/well GVB^ to remove 
inhibitors and unbound protein. EGTA was used in stage 4 to block new addition of 
CI from the serum and therefore make the final stage dependent on previous 
deposition of C3b. In this component assay, anti-C2 pIgG but not normal pIgG, 
blocked complement activation at stage 3 as expected. 

The lead compound inhibited stage 4 in a dose-dependent manner but 
not stages 1, 2, or 3. These results indicated that the compound did not inhibit 
formation or activity of the CCP/LCP C3 convertase but inhibited either the C5 
convertase or subsequent formation of MAC, the terminal component of the 
complement system. 

In order to determine whether the lead compound inhibited the activity 
of the C5 convertase or subsequent formation of the MAC, a simplified component 
assay was carried out. C2-depleted NHS was obtained (Advanced Research 
Technologies, San Diego, CA). EA were washed twice with GVB^ and resuspended 
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at 2 X 10^ cells/ml in GW\ An equal volume of GVB"^^ containing a 1 :50 dilution 
of C2-depleted NHS was added and the cells were incubated at 30°C for 7.5 minutes 
to allow deposition and activation of CI, and subsequent cleavage of C4. The cell 
suspension was diluted 20-fold with GVB^^ to stop the reaction, and centrifuged 2400 
RPM for three minutes. The cell pellet was washed three times with GVB'"^ and 
resuspended at 2 x 10^ cells/ml in GVB^"-. Fifty ^il/well of the treated EA was added 
to duplicate wells of a Costar 96-well round-bottom plate, along with 50 [il/well of 
GVB'^ containing 1 |ig/ml C2, 50 ja&'ml C3, and 1 |ag/ml C5, with or without anti-C2 
(80 M-g/^ml) normal goat IgG (80 fxg/ml). Lead compound (80 \iM) or DMSO was 
added and the plate was incubated at 30°C for 20 minutes. Two hundred [il/well of 
GVB^^ was added, the plate was centrifuged 2400 RPM, 3 minutes, the supematants 
were aspirated, and the pellets washed once with 200 jil/well GVB"^ ' . The cell pellets 
were resuspended in 100 |il/well GVB and 100 |il/well GVB containing 40 mM 
EDTA and 1 :50 NHS was added, after which the plate was incubated at 37°C for 60 
minutes. The plate was centrifiiged again, and 100 [iVwell was transferred to an 
hnmulon4 96-well flat-bottom plate and absorbance determined at 420 nm. 

Both the anti-C2 pIgG and the lead compound specifically inhibited 
hemolysis, indicating that the compounds inhibit the CCP/LCP C5 convertase activity 
directly. These results were consistent with a potential mechanism of complement 
inhibition wherein the test compound bound C2 or Factor B and inhibited a 
conformational change necessary for the serine protease domain to gain access to the 
C5 substrate. Crystal stracture data of the Factor B serine protease domain and 
modeling of its interaction with the A domain is consistent with this hypothesis [Hua 
Jing, et al, EMBO J. 19:164-173 (2000)]. 

The top 5 inhibitors of complement proteins C2 and Factor B are 
shown in Table 4. 
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TABLE 4 
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Example 5 

Isolation of cDNAs for Alpha E, E-cadherin, and MAdCAM-1 

In order to assess whether it is possible to modulate binding activity of 
other a/p proteins, DNA encoding alpha E, E-cadherin and MAdCAM-1 were 
prepared as follows. 

A. Alpha E 

1. Isolation of human alpha-E cDNA 

DNA encoding human alpha-E was isolated jfrom a normal human 
intestinal cDNA library (Clontech Laboratories, Inc., Palo Alto, CA) using an alpha E 
I domain cDNA as a probe. The alpha E I domain probe was cloned by PGR 
amplification using a human colon cDNA library as template and primers 
encompassing the 5' and 3^ ends of the alpha E I domain. Li order to facilitate cloning, 
BamUl midXhol restriction sites (underlined in the sequence) were designed into the 
5' (SEQ ID NO: 7) and 3' (SEQ ID NO: 8) primers. 

ATT GGA TCC OCT GGC ACC GAG ATT GCC ATC SEQ ID NO: 7 

AAT TTCTCGAG GTC TCC AAC CGT GCC TTC C SEQ ID NO: 8 

A 607 bp I domain fragment was ampUfied, digested with BamHl and 
Xhol, and inserted into the plasmid pBluescript® SK (Stratagene, La Jolla, CA). The 
plasmid was transformed into bacteria, plasmid DNA was prepared according 
pubUshed procedures, and the BamHl/Xhol insert was purified. The fragment 
encoding the alpha E I domain was radiolabeled with ^^P-dCTP and ''P-dTTP using a 
random primed DNA labeling kit (Roche Diagnostics Corp., hidianapolis, IN) for use 
as a hybridization probe. 

DNA encoding full-length alpha E was identified as follows. A human 
intestinal cDNA library in phage lambda GTll (CLONTECH Laboratories, hic, Palo 
Alto, CA) was plated and hybridized with the I domain probe using standard 
procedures. From two rounds of screening, six phage clones were isolated. The 
cDNA inserts were isolated from the phage by EcoRI digestion, subcloned into 
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pBluescript® SK (Stratagene, La JoUa, CA), and sequenced. A complete 3.4 kb 
sequence was reconstituted from three different clones: clone A (3) encompassing the 
5' end, clone B (22) that included sequences from in the middle of the cDNA, and 
clone C (22) encompassing the 3' end of alpha E cDNA. Sequence analysis indicated 
5 that clone A (3) contained an insertion of two cytidines and another insertion of a 

guanine at positions 357 and 464, respectively, when compared to the published 
nucleotide sequence. These insertions resulted in a 75 base frameshift in the open 
reading frame which resulted in the addition of 25 additional amino acid residues, 
shown below, not found in the previously reported sequence. 

10 

PKGRHRGVTVVRSHHGVLICIQVLVRR SEQ ID NO: 9 

The sequences downstream from this 25 amino acid insertion were identical to the 
published alpha E sequence for the rest of the molecule. 
15 In order to subclone the alpha E cDNA into pcDNA3® (Invitrogen 

Corp., Carlsbad, CA), a Hindlil site was generated at the 5* end by PCR amplification 
using the 5' primer Eo26-H3 (SEQ ID NO: 10) and the 3' primer Eo-24 (SEQ ID NO: 
11) primers shown below. 

20 GAG GGG AAG CTT AGT GGG CC SEQ ID NO: 10 

GAA GTT GGC CTG AGC CTG G SEQ ID NO: 1 1 

The PCR product was digested with Hind ID and Nsil^ and ligated into the 
corresponding sites of the vector. 

25 The expression vectors pMHneo [Hahn et aL, Gene 127:267-268 

(1993)] and pcDNA3®/aE were transformed into the bacterial strain NEB316, a dam" 
strain which does not methylate Xbal restriction sites, and plasmid DNA isolated 
according to standard procedures. Both pMHneo and pcDNA3®/aE were digested 
with Hindm and Xbal and the 3.4 kb alpha E cDNA fragment from pcDNA3®/aE was 

30 separated using agarose gel electrophoresis. The fragment was excised from the gel, 

purified, and ligated into HindUUXbal'digestod vector pMHneo. An aliquot of 
ligation mixture was used to transform XL-1 Blue bacteria (Stratagene, La Jolla, CA) 
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according to the manufacturer's protocol, and bacterial colonies containing pMHneo 
were selected by growth on LBM agar plates containing ampicillin. Bacterial colonies 
were grown overnight in LBM media containing 1 00 ug/ml ampicillin and plasmid 
DNA was isolated using the Wizard Plus Miniprep Kit (Promega Corp., Madison^ 
5 WI)- The plasmid DNA was characterized by diagnostic restriction digestion and a 

plasmid containing the alpha E cDNA, referred to as pMHneo/aE, was used to stably 
transfect a JY cell line as described below. 

B. E-cadherin 
10 1. Isolation of E-cadherin cDNA 

The cDNA for human E-cadherin was isolated by PCR amplification 
of a Marathon-Ready™ human colon cDNA library (CLONTECH Laboratories, Inc. 
Palo Alto, Cahfomia) using E-cad 5W1 (SEQ ID NO: 12) and E-cad 3'#1 (SEQ ID 
NO: 13) primers, which are set forth below. 

15 

5'-CTGCCTCGCTCGGGCTCCCCGGCCA-3' SEQ ID NO: 12 

5'-CTGCACATGGTCTGGGCCGCCTCTCTC-3' SEQ ID NO: 1 3 

Polymerase chain reactions were performed in a Perkin Elmer Cetus (PE Applied 
20 Biosystems, Foster City, CA) DNA thermal cycler in a reaction mixture containing 5 

til of the library cDNA, 10 |al of 5X PCR buffer from an Advantage™-GC cDNA 
PCR Kit (CLONTECH Laboratories, hic. Palo Alto, California), 1 \xl of SOX dNTP 
mix, 1 ^1 of 10 \iM primer E-cad5'#l, 1 |j,l of 10 |iM primer E-cad 3'#1, 1 |al of 
Advantage™ KlenTaq polymerase mix, and 31 p,l of H2O. AmpUfication conditions 
25 included an initial incubation for I min at 94*^0, followed by 5 cycles at 94°C for 30 

sec and 72°C for 4 min; 5 cycles at 94^C for 30 sec and 70*^0 for 4 min; 25 cycles at 
94^*0 for 30 sec and 68^*0 for 4 min; and a final 5 min incubation at 72''C. An aliquot 
of the reaction was separated using agarose gel electrophoresis to determine the 
approximate size of the PCR product and a single band of -^2.7 kb was detected as 
30 anticipated. The 2.7 kb PCR product was ligated into the plasmid pCR®2.1 using a 

TA Cloning® Kit (Invitrogen Corp., Carlsbad, California) according to the 
manufacturer's protocols. E. coli strain INVaF' (Invitrogen Corp., Carlsbad, CA) was 
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transformed with an aliquot of the Ugation reaction as recommended by the 
manufacturer and single bacterial colonies were isolated and grown overnight in LBM 
media containing 100 (xg/ml ampicillin. Plasmid DNA was isolated from these 
cultures using the Wizard Plus Miniprep Kit (Promega Corp., Madison, WI). 

5 

2. Generation of DNA encoding a E-cadherin/Ig fusion protein 

The extracellular region of E-cadherin is made up of five tandem 
repeats (domains) of approximately 110 amino acids each. In order to express an 
E-cadherin-human/human IgGl fusion protein, a DNA fragment containing domains 1 

10 through 5 of E-cadherin was generated by PGR amplification of the E-cadherin cDNA 

(pCR®2.1/E-cadherin #3 described above) with primers EcadS'Kozak (SEQ ID NO: 
14) and EcadS^Xho) (SEQ ID NO: 15). The 5^ primer EcadS'Kozak was used to add a 
5* HindJR site to facilitate subsequent subcloning of the 5 -domain fragment into the 
expression vector pDEF2 (see U.S. Patent 5,888,809) and reconstitute a Kozak 

15 sequence upstream of the translation initiation codon which was lacking from initial 

E-cadherin cDNA clone. The 3* primer Ecad3'(Xho) generated a new 3* end of the 
fragment containing domains 1 through 5 of E-cadherin, and added a,Xhol restriction 
site to the 3' terminus of the fragment to facilitate subsequent subcloning of the 
5 -domain fragment into pDEF2, 

20 

5'-GCGTTAAAGCTTCACAGCTCATCACCATGGGCCCTTGGAGCCGCA-3' 

SEQ ID NO: 14 

5'-AGGCGCTCGAGAATCCCCAGAATGGCAGGAATT-3' SEQ ID NO: 1 5 

25 The E-cadherin cDNA fragment contained in pCR2.1/E-cad#3 was 

amplified by PGR in a reaction containing 0.5 \il of pCR2.1/E-cad#3, 10\xl of 5X 
PGR reaction buffer, 1 p,l of 10 |iM primer Ecad5*Kozak, 1 |j.1 of 10 |iM primer 
E-cad3'(Xho), 1 \xl of Advantage™ KlenTaq polymerase mix, and 35.5 |li1 of H2O. 
Amplification conditions included an initial incubation for 1 min at 5 cycles at 

30 9rC for 30 sec and 72°C for 4 min; 5 cycles at 94**C for 30 sec and 70*^0 for 4 min; 25 

cycles at 94*^0 for 30 sec and 68**C for 4 min; and a final 5 min incubation at 72*'C. 
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An aliquot of the PCR reaction was resolved by agarose gel electrophoresis, and a 
single band of 2.1 kb was observed as expected. The fragment was purified using the 
Wizard PCR Purification Kit (Promega Corp., Madison, WI), and digested with Xhol 
and HindlU under standard conditions. The resulting fragment was referred to as 
5 5^HindIII-Kozak-E-cadherin-XhoI-3\ 

The plasmid pDCl/ICAM3.IgGl was digested with Xbal and SaK and 
a fragment of 908 bp (referred to as 5'-SalTIgGl-XbaI-3') with a 5' terminal SaE site 
and a 3' terminal Xbal site was purified ft"om a low melting temperature agarose gel 
(FMC BioProducts, Rockland, ME). This fragment contains the sequences encoding 

1 0 the CH2-CH3 region of human IgGl . The expression vector pDEF2 was linearized in 
the multiple cloning site with HindUl and Xbal and a three-way ligation reaction was 
performed which contained the 5 -Hindin-Kozak-E-cadherin-XhoI-3' fragment, 
linearized pDEF2, and the 5'-SalI-IgGl-XbaI-3' fragment. In this reaction, the 3' Xhol 
site in 5'-Hindin-Kozak-E-cadherin-XhoI-3'was joined in-frame to the 

15 5'-SalI-IgGl -XbaI-3'; both Xhol and Sail have compatible 5' overhangs which can be 

ligated together but cannot be re-digested with either Xhol or Sail, An aliquot of the 
hgation reaction was used to transform the bacterial strain XL-1 Blue (Stratagene, La 
JoUa, CA). Individual bacterial colonies were grown overnight in LBM containing 
100 |ig/ml ampicillin, and plasmid DNA was isolated with a Wizard Plus Miniprep 

20 Kit (Promega Corp., Madison, WI). The pDEF2/E-cadIgGl plasmid DNA was 

digested with HindJH and Xbal and the digestion products resolved by agarose gel 
electrophoresis. Those clones containing a 2.1 kb fragment were sequenced to ensure 
that the E-cadherin-IgGl chimera maintained an open reading frame across the 
E-cadherin/IgGl junction. 

25 The pDEF2/E-cadIgGl clone #3 was found to contain a continuous 

open reading frame across the E-cadherin/IgGl junction and was used for CHO cell 
expression studies described below. The open reading frame of the E-cadherin/IgGl 
fusion was not sequenced in its entirety since the DNA fragments contributing to this 
chimera had been previously sequenced and had not been subjected to PCR 

30 amplification. 
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C. MAdCAM l-1 

!• Isolation of a partial cDNA for human MAdCAM-1-1. 

A fragment containing a partial cDNA for MAdCAM-1 was isolated 
by PCR amplification of Marathon-Ready™ human spleen cDNA library with an 
5 Advantage™-GC cDNA PCR Kit (CLONTECH Laboratories, Inc., Palo Alto, CA), 

Polymerase chain reactions were performed in a Perkin Elmer Cetus (PE Applied 
Biosystems, Foster City, CA) DNA thermal cycler in a reaction containing 5 \il of 
Marathon™ human spleen cDNA, 1 jxl of 10 |iM primer MAdCAM-1 5W1 (SEQ ID 
NO:16), 1 [il of 10 \iM primer MAdCAM-1 y#5 (SEQ ID NO: 17), 10 ^il of 5.0 M 
1 0 GC-Melt™, 1 \il of SOX dNTP mix, 1 |il of Advantage™ KlenTaq polymerase mix, 

10 |ll1 of 5X reaction buffer, and 21 ^1 of H2O. 

5'-ATGGATTTCGGACTGGCCCTCCTGCT-3' SEQ ID NO: 1 6 

5'-CTCCAAGCCAGGCAGCCTCATCGT-3^ SEQ ID NO: 1 7 

15 

Amplification conditions included an initial incubation for 1 min at 94**C; 5 cycles at 
94X for 30 sec and 72°C for 3 min; 5 cycles at 94°C for 30 sec and 70°C for 3 min; 
25 cycles at 94°C for 30 sec and 68^C for 3 min; and a final incubation for 5 min at 
68^C. An aliquot of the reaction was resolved by agarose gel electrophoresis and a 

20 single fragment of 640 bp was detected. The fragment was subcloned into pCR®2.1 

and amplified in bacteria using the TA Cloning® Kit (Invitrogen Corp., Carlsbad, CA) 
following the manufacturer's protocol. Single bacterial colonies were grown 
overnight in LBM containing 100 |ig/ml ampicillin. Plasmid DNA was isolated from 
the cultures using a Wizard Plus Miniprep Kit (Promega Corp., Madison, WI), and the 

25 nucleotide sequence of the subcloned PCR product was determined by DNA sequence 

analysis. This partial cDNA for MAdCAM-1 begins with the initiation codon and 
terminates at its 3' end at residue 640 in domain 2. The sequence of this partial 
MAdCAM-1 cDNA is identical to that previously reported [Shyjan et al, J. frnmunol. 
156:2851-2857(1996)]. 

30 

2. Additional PCR amplification DNA encoding MAdCAM-1 domains 1 and 2 
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In order to express domains 1 and 2 of MAdCAM-1 as a secreted 
immunoglobulin fusion protein, it was essential to; (i) restore a Kozak sequence 
upstream of the initiation codon to allow for efficient protein translation; (ii) add a 5' 
Hindm site to facilitate subcloning of the fragment into pDEF2; (ii) extend the open 
5 reading frame of the existing partial MAdC AM- 1 cDNA to encompass additional 

amino acid residues needed to encode the entire second domain; and (iv) introduce a 
SaK site at the 3' terminus of the fragment to facilitate subcloning into pDEF2. These 
modifications were introduced into the MAdC AM- 1 fragment described above by 
PGR amplification using the primers Mad5 'Kozak (SEQ ID NO: 18) and Mad 3' #6 
1 0 Sal (SEQ ID NO: 1 9) as shown below. 



5'-GCGTTAAAGCTTCACAGCTCATCACCATGGATTTCGGACTGGCCCTCCT- 
3' 



15 



SEQ ID NO: 18 

GCTAGTCGACGGGGATGGCCTGGCGGTGGCTGAGCTCCAAGCAGGCAGCCTCATC 
GT 

SEQ ID NO: 19 

The PGR reaction included 0.5 \il of pGR®2.1/MAd#4-l template DNA, 10 ^1 of 5X 
20 PGR buffer, 10 \il of 5.0 M GG Melt™, i ^xl of SOX dNTP mix, 1 pi of 10 [iM, 1 nl 

of 10 fiM, 1 nl of SOX Advantage™ KlenTaq polymerase mix, and 25.5 \il ofKfi. 
The PGR amplification conditions included 94°G, for 1 min; 5 cycles at 94''G for 30 
sec and 72°G for 2 min; 5 cycles at 94''G for 30 sec and 70°G for 2 min; 20 cycles at 
94°G for 30 sec and 68°G for 2 min; and 68''G for 5 min. An aliquot of the reaction 
25 was resolved by agarose gel electrophoresis and a single fragment of - 0.7 kb was 

detected as expected. The PGR product was purified using the Wizard PGR 
Purification Kit (Promega Gorp., Madison, WI) and digested with Hindm and Sail 
under standard conditions. The fragment was Ugated into Hindm/Sall digested 
pBluescript® SK plasmid DNA (Stratagene, La JoUa, GA) under standard 
30 conditions, and the sequence of the MAdGAM-1 fragment in pBS-SK/Mad#7 was 

determined. 
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3. Generation of MAdCAM-l/Ig Fusion Protein 

To generate an expression vector encoding a chimeric 
domain l/domain2 MAdCAM-1-IgGl fusion protein, the 702 bp Hindlll-San 
fragment from pBS/Mad#7, the 908 bp Saa-Xbal fragment from pDCl/ICAM3.IgG 
and pDEF2 linearized by digestion with Hindm and Xbal were combined in a 
ligation reaction. An aliquot from the Hgation reaction was used to transform XL-1 
Blue bacteria (Stratagene, La JoUa, CA) and the plasmid DNA isolated from single 
colonies were screened by restriction digestion with Hindm, Xbal, and Sail. One 
clone, pDEF2/MadIg#l, was found to contain all three fragments and was used to 
generate stably transfected CHO cell lines as described below. 

Example 6 

Expression of MAdCAM-l/Ig and E-cadherin/Ig 
A. Generation of Stable CH O Cell T.ines Expressing MAdCAM-1/Tff and 
E-cadherin/Ig 

For transfection of host CHO DG44 cells with pDEF2/MAdIg or 
pDEF2/EcadIg, 50 to 100 ug of plasmid was linearized by digestion with the 
restriction enzyme Pvul. DG44 cells were cultured in DMEM/F-12 medium 
supplemented with hypoxanthine (0.01 mM final concentration) and thymidine 
(0.0016 mM final concentration), also referred to as "HT". DG44 cells were 
prepared for transfection by growing cultures to about 50% or less confluency in 
treated 150 cm^ tissue culture polystyrene flasks (Coming Inc., Coming, NY). Cells 
were collected and resuspended in 0.8 ml of a solution containing HeBS buffer (20 
mM Hepes, pH 7.0, 137 mM NaCl, 5 mM KCl, 0.7 mM, Na2HP04 and 6 mM 
dextrose) with the desired plasmid DNA. The resuspended cells were electroporated 
at room temperature with a capacitor discharge of 290 V and 960 jaF (9 to 1 1.5 msec 
pulse). Cells were added to 10 ml DMEM/F-12 supplemented with 5% dialyzed 
FBS and HT, pelleted by centrifugation, resuspended in 2 ml DMEM/F-12 
supplemented with 5% dialyzed FBS and HT ("non-selective media"), and seeded 
into 75 cm^ polystyrene tissue culture flasks. After two days growth the cells were 
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collected and seeded at varying dilutions in DMEM/F-12 supplemented with 5% 
dialyzed FBS and without HT ("selective media"). 

Once selection was complete and single cell clones could be 
identified, a single cell suspension of pooled CHO transfectants was prepared by 
typsinization. In order to isolate individual clones, the CHO/MAdCAM-lIg and 
CHO/E-cadIg transfectants were plated at a density of approximately 1 cell/well in 
Immulon-4 96-well plates (Dynex Technologies, Die, Chantilly, VA) under selective 
conditions. Once single colonies were detected in the 96-well plates, supernatant 
from each well was screened for the presence of MAdCAM-l/Ig or E-cadherin/Ig 
fusion protein. Single cell CHO clones producing a human IgGl protein component 
were expanded, and those clones producing the greatest level of MAdCAM-l/Ig or 
E-cadherin/Ig fusion protein were selected for large-scale protein production. 

In large-scale protein production, the CHO/MadIg and CHO/E-cadIg 
clones were expanded in serum-free 5.2 (HT") media in a spinner flask maintained at 
2>TC in an atmosphere of 5% CO2. When cell densities exceeded 10^ cells/ml, the 
media was harvested and the spinner flask was provided with fresh 5.2 (HT-) media. 
The spent media was first centrifuged to remove cell debris, filtered through a 0.22 
|im 1 liter filter unit (Coming Inc., Coming, NY), and stored at 4*^C. 

B. MAdCAM-l/Ig Pttrification 

MAdCAM-l/Ig was purified by affinity chromatography using a 
protein A-Sepharose® 4 Fast Flow resin column (Flow (Amersham Pharmacia 
Biotech, Inc., Piscataway, NJ) equilibrated with CMF-PBS. The cell supernatant was 
cycled through the column at a rate of 4 ml/min. After loading, the column was 
washed with CMF-PBS until there was no detectable protein present in the eluate. 
MAdCAM-l/Ig was eluted with 0.1 M acetic acid (pH 3,0) into a tube containing IM 
Tris, pH 9.0, and the sample was dialyzed at 4°C against CMF-PBS. 

C. Purification of E-cadherin/Ig 

E-cadherin/Ig was purified by affinity chromatography using a protein 
A-Sepharose® 4 Fast Flow resin column (Amersham Pharmacia Biotech, Inc., 
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Piscataway, NJ) equilibrated with D-PBS. The supernatant was cycled through the 
column at a rate of approximately 4 ml/min. After loading, the column was washed 
with Tris-buffered saline, pH 8.0, containing 1 mM CaCl2 until there was no 
detectable protein present in the eluate. E-cadherin/Ig was eluted with 0.1 M acetic 
5 acid (pH 3.0) containing 1 mM CaCl2 into a tube containing IM Tris, pH 9.0. 

Calcium concentration was adjusted to 1 mM and the sample was dialyzed at 4°C 
against Tris-buffered saline (pH 6.8) containing 1 mM CaCl2. 

Example 7 

1 0 Generation of JY/alpha-E Transfectants 

The human B lymphoblastoid cell line, JY, was transfected with the 
plasmid pMHneo/aE as described above. The transfected population was grown in 
"selection media" (containing RPMI 1640 media supplemented with 5% FBS, 100 
U/ml penicillin G, 100 |ig/ml streptomycin sulfate, 2 mM L-glutamine, 1 mM 

15 sodium pyruvate, and 1.0 mg/ml G418) and after 14 days, 10^ G418-resistant JY cells 

were resuspended in 5 ml of selection media containing 5 [ig/ml of the anti-aE 
monoclonal antibody Ber-ACT8 (DAKO Corp., Carpinteria, CA) and incubated on 
ice for 1 hour. Cells were collected by centrifugation, and the media was aspirated. 
The JY/aE transfectants were stained with selection media containing a 1 :200 

20 dilution of sheep anti-mouse Ig-FITC (Sigma Corp., St. Louis, MO) on ice for 1 

hour. Unbound antibody was removed by centrifiigation and the supernatant 
aspirated. Alpha E-expressing cells were isolated by flow cytometry and 
subsequently expanded by in vitro culture in selection media. 

Re-analysis of the sorted JY/aE^ population by flow cytometry with 

25 Ber-ACT8 revealed a bimodal population of cells that contained both alpha 

E-expressing and alpha E-nonexpressing cells. The bimodal population was stained 
a second time with Ber-ACT8 as previous described, and individual JY/aE cells were 
sorted into a 96-well Immulon-4 plate containing selection media. Single cell JY/aE^ 
clones expressing high levels of alpha E were expanded in vitro and JY/aE clone #47 

30 was selected for further characterization. This clone, but not the parental JY cells, 

displayed robust adhesion to recombinant E-cadherin/Ig and the binding was induced 
with phorbol ester treatment of the cells. Binding of JY/aE clone #47 to 
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E-cadherin/Ig was blocked by the anti-Py integrin antibody FIB504 (ATCC, 
Rockville, MD), as well as by antibodies to E-cadherin (Zymed Corp., So. San 
Francisco, CA). 

5 Example 8 

Isolation of a JY/alpha Clones 

To obtain a JY cell line that stably expresses the a^P2 integrin, JY 

cells were electroporated with pMHneo/aD as described above and stable 

transfectants were selected by growth in selection media. After a G418-resistant 

10 population of cells had been selected, JY/aD^ cells were stained with the anti-a^j 

monoclonal antibody 21 2D and sheep anti-mouse-FITC (Sigma Corp., St. Louis, 

MO). Single cell JY/aD^ clones were isolated by cell sorting using a flow cytometer 

as previously described for the isolation of single cell JY/aE^ clones. 

15 Example 9 

JY/aE^ Adhesion Assays 

A. Compound Dilutions 

Adhesion media (350 |j,1) (RPMI 1640 containing penicillin and streptomycin, 
L-glutamine, NaPy, and 5% FBS) was aliquotted into each well in rows A, C, E, G 

20 of a deepwell 96~well titer plate, 2.0 ml capacity (Beckman Listruments, Mc, 

FuUerton, CA) in columns 1-11. All compounds to be screened were dissolved in 
DMSO to a final concentration of 10 mM. Compounds were stored at -20''C, and 
thawed on the day of use in a incubator. Each compound (2,1 |j.1 ) was pipetted 
into a single well in columns 3-1 1, rows A, C, E, and G in the deepwell titer plate. 

25 To wells not containing compound (Al & A2, CI & C2, El & E2, Gl & G2), 2.1 ^l 

DMSO was added. An anti-P^ monoclonal antibody, FIB504 (ATCC, Rockville, 
MD), which blocks a^^j binding activity, was added to wells C2 and G2 at a 
concentration of 7.5 ng/ml. Each deepwell titer plate was covered to prevent 
dessication and stored in a 37°C incubator until ready for use. 

30 

B. Adhesion Assay 
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Adhesion assays were performed in 96-well Immunlon 4 plates 
(Dynex Technologies, Inc., Chantilly, VA) as follows. Each well was coated with 50 

E-cadherin/Ig (3.0 ng/ml) in D-PBS. Control wells were coated with capture 
antibody FrB504, to quantitate 100% input cell binding, or coating buffer alone to 
determine background binding. Following an overnight incubation at 4'*C, the plates 
were washed three times with 200 ^I/well D-PBS and blocked with 1% BSA in 
D-PBS for at least 1 hour. The BSA solution was removed and 100 \i[ of adhesion 
media (RPMI 1640 containing penicillin and stretomycin, L-glutamine, sodium 
pyruvate, 0.1% BSA, and 60 ng/ml PMA), was added to rows B through G, columns 
1 through 1 1 . 

At this point, 100 jil adhesion media containing a test compound at a 
concentration of 60 [iM, was transferred from the deepwell 96-well titer plate, in 
tripUcate, to the E-cadherin/Ig coated adhesion plate. The outer rows were filled with 
300 |al of D-PBS. These plates were transferred to a humidified 37°C incubator with 
an atmosphere of 5% COj. 

The adhesion assay was initiated by addition of 100 \il of the JY/aE"^ 
cell suspension to each well of the E-cadherin-coated plate. The final volume in each 
well was 300 iil adhesion media containing 10^ cells, PMA (final concentration 20 
ng/ml), and the test compound (final concentration 20 \iM). The plates were 
incubated at 37°C for 30 min. Each compound was tested in triplicate. 

Adherent cells were fixed by the addition of 50 \i\ of a 14% 
glutaraldehyde solution in D-PBS. Plates were washed with water, stained with 100 
til/well 0.5% crystal violet (Sigma Corp., St. Louis, MO) solution for 5 min. Three 
hundred microliters/well of 70% ethanol was added, and adherent cells were 
quantitated by determining absorbance at 570 ran. Percentage of cell binding was 
determined by using the mean values for each triplicate in a given assay in the 
following formula. 

% binding = r A570 (binding to E-cadhe rin/lg^ - A570 (binding to BSA) x 100 
A570 (binding in adhesion media without compound) 

C. IC50 Determinations 
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During the initial screening of test compounds, each chemical entity 
was tested in cell-based adhesion assays at a fixed concentration of 20 fiM. Those 
compounds that blocked JY/aEPy-dependent adhesion to E-cadherin/Ig by 50% or 
more were subsequently retested at multiple concentrations to determine the 
5 inhibitory concentration at which cell binding is reduced by 50%, Le. , the IC50 value. 

Of the compounds screened, 40, or 1.4 % of the total hbrary, inhibited 
c^EPv-E-cadherin adhesion by 40% or greater. Approximately 18 of the compounds 
were identified in the diarylamide Kbrary, and 22 compounds were identified in the 

10 diaryl sulfide library. Upon re-analysis of these primary hits in IC50 determinations, 4 

of the 40 compounds were shown to inhibit JY/aE+ binding to E-cadherin with an 
IC50 value of not more than 10 |aM. Many of the initial hits were eliminated from 
further characterization if their initial inhibitory activity was not reproducible; or a 
compound was shown to inhibit multiple integrin-dependent adhesive events; or the 

15 IC50 value exceeded 10 |xM, or it displayed any cytopathic or cytotoxic effects. The 

following compounds displayed reproducible inhibitory activity at compound 
concentrations below 10 jxM: Cmpd K, Cmpd W, Cmpd Z, Cmpd D as set out in 
Table 2. There were several compounds that displayed significant inhibitory activity 
in the initial screen that failed to inhibit JY/aE+/E-cadherin binding upon re-analysis. 

20 It is possible that the activity of some diaryl compounds was lost upon repeated 

freezing and thawing. 

To assess the selectivity of each compound, an IC50 value was 
determined for additional binding partner compounds JY/avp3 and vitronectin, 
JY/a^pj and VCAM/Ig„ JYIa^i and VCAM, JY/a^Pa and ICAM-1, JY/aJ^^ and 

25 iC3b, and JYla^^^ and MAdCAM- 1 . 

For each IC50 assay, 50 |il of the ligand diluted in 50 mM bicarbonate 
buffer (pH 9.6) was dispensed per well of an Inimulon-4 plate. A single plate was 
used to test two different ligands, each in triplicate. The coating concentration for the 
various Hgands was as follows: VCAM-l/Ig at 2.0 |ig/ml; ICAM-l/Ig at 5.0 txg/ml; 

30 vitronectin at 0.5 lag/ml; MAdCAM-l/Ig at 3.0 |ig/ml, and iC3b at 5.0 ^g/mL The 

capture antibody, e,g, anti-CD 18 monoclonal antibody TSl/22, was added at a 
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concentration of 10 |ig/ml in 50 |il/well. Ligand-coated plates were covered and 
stored overnight at 4''C. The following day, the contents of each well was decanted, 
and each plate was washed three times with 200 nl/well D-PBS. The plate was then 
blocked by the addition of 300 |il/well of 1% BSA/D-PBS solution. Each plate was 
again covered and incubated at room temperature for at least 1 hour. 

For each IC50 determination, the test compound was serially diluted 
in DMSO to enable testing at final concentrations of 40 |iM, 20 [iM, 10 |i.M, 5.0 \iM, 
2.5 ^M, 1.25 ^iM, 0.63 \iM, 0.32 nM and 0.16 [iM. Prior to transfer to the adhesion 
plate, the compounds were initially diluted by transferring 4.2 [il of the diluted 
compounds to a 96-well deepwell titer plate containing 0.7 ml/well of RPMI 1640, 
0.1% BSA, and 3 ng/ml PMA (Sigma Corp., St. Louis, MO) pre-warmed to 37°C. 
The 1% BSA/D-PBS blocking solution was decanted from the 96-well Immulon-4 
plates and replaced with 0.2 ml of diluted compound. For each 96-well plate to be 
screened, approximately 8x10^ cells were collected by centrifligation and 
resuspended in adhesion media (RPMI 1640 containing 0.1% BSA) to a final 
concentration of lOVml. To prevent PMA-dependent homotypic aggregation in the 
adhesion assay, the anti-CD18 antibody 22F12C (ICOS Corp., Bothell, WA) was 
added to the cell suspensions to a final concentration of 10 [xg/ml, and the cells were 
incubated at 37°C for 15 min. This antibody was not added to CD18-dependent 
adhesion assays involving JY/cClPz, JY/a^p^ or JY/aMP2 and their corresponding 
ligands ICAM-1, VCAM-1, or iC3b. 

The adhesion assay was initiated by addition of 100 jxl of the cell 
suspension to each well of the hnmulon-4 plate. The plates were incubated at 37°C 
for 30 min and adherent cells were fixed for least 1 hour by the addition of 50 [al of a 
14% glutaraldehyde solution in D-PBS. The plates were washed with water and 
stained with 100 jil/well 0.5% crystal violet (Sigma Corp., St. Louis, MO) solution 
for 5 min. The plates were washed a second time with water to remove excess 
crystal violet dye, and 300 [il 70% ethanol was added to each well. Adherent cells 
were quantitated by determining the absorbance at 570 nm in a plate 
spectrophotometer. The percentage of cell binding was determined by using the 
mean values for each triplicate in a given assay and the formula below. 
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% Binding = A570 (binding to lieand) - A570 rbinding to BSA) x 100 

A570 (binding in adhesion media without compound) 

The four compounds Cmpd K, Cmpd W, Cmpd Z, Cmpd D identified 
5 in the primary screen were selected for further specificity profiling, whereby their 

IC50 values were determined in additional integrin-dependent adhesive events. In all 
cases, the indicator cell line used in the binding assay was treated with 2 ng/ml PMA 
during the course of the assay to stimulate integrin-dependent adhesion. The IC50 
values of these four compounds were determined in adhesion assays as indicated in 
10 Tables, 



TABLES 
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Compound 


E-cadher 


MAdCAM 


iC3b 


VN 


ICAM-1 


VCAM 


VCAM 






-1 a^^j 


«Mp2 




«lP2 


«4pi 


«dp2 


CmpdK 


3 \lM 


4|xM 


4jiM 


6jiM 


11 [iM 


>40 \lM 


>40 [iM 


CmpdD 


3 \iM 


4|iM 


7^M 


4^M 


28 nM 


>40\lM 


>40 nM 


CmpdW 


5 [iM 


5 [iM 


4jiM 


8^M 


30 nM 


>40 nM 


>40 |JiM 


CmpdZ 


3 |iM 


11 |iM 


ND 


ND 


20|a,M 


>40 |J.M 


>40 nM 



Example 10 
Cloning, Expression and Purification 
of Alpha 1, Alpha 2 and Alpha 111 Domains 

The collagen-binding integrins alpha 1, alpha 2 and alpha 1 1 contain I 
domain sequences homologous with the I domain sequences contained in the 
leukointegrins alpha L, alpha M, alpha X and alpha d. To investigate the possibility 
that these molecules might be susceptible to modulation through an allosteric 
regulatory site, the library of test compounds was assessed for the ability to inhibit 
interactions between these integrins and their ligands collagen and laminin. 

The alpha 1 and alpha 2 I domain sequences and alpha 1 1 were cloned 
into the bacterial expression vector pET15b (Novagen). Expression of these 
constructs in E. coli results in proteins with an amino terminal histidine tag and the 
"tagged" protein which can be purified using a nickel column. The cloning of the 



111 



alpha 1 1 was carried out as previously described [Veiling, et al, J. Biol. Chem, 
274:25735-25742 (1999)]. 

Both alpha 1 and alpha 2 I domain sequences were cloned into 



pETlSb following PGR amplification to add restriction sites that permit the I 
5 domains to be cloned in frame with the histidine tag in the vector. The template for 

the alpha 1 I domain PGR reaction was a full-length alpha 1 cDNA cloned by 
hybridization fi-om a spleen cDNA library in vector pcDNA-1 Amp as previously 
described. The hybridization probe used for this screen was the product of the PGR 
reaction using the following Alphal.5 (SEQ ID NO: 20) and Alphal.3 (SEQ ID NO: 
10 21) primers, respectively: 



1 5 The samples were initially incubated at 94°G for 30 sec followed by 5 cycles of 94^G 

for 5 sec and 72^G for 2 min; 5 cycles of 94*^G for 5 sec and 70*^G for 2 min; 25 cycles 
of 94^C for 5 sec and 68°G for 2 min; and a final incubation of 72^C for 7 min. The 
PGR products were cloned into the TOPO TA vector pGRII (Invitrogen) and 
sequenced. The resulting clone was used as a template in PGR using the same 

20 conditions as above and the amplification product was gel purified, labeled with ^^P 

using a random primed labeling kit (Boehringer Mannheim), and used as a 
hybridization probe. Hybridization was performed using ExpressHyb hybridization 
solution (Glontech) under the same conditions used in the screening for full length 
alpha 1 1 cDNA The resulting clone, alphal/pcdna/1 1 1 was used as a template to 

25 subclone the alpha 1 I domain. 



The alphal I domain was amplified by PGR using AL5Nde (SEQ ID 
NO: 22) and A1.3Bam (SEQ ID NO: 23) primers, respectively shown below 



5'-i 



GAGTTTGAGGGGGGGGGTGGAAGAGATG-3' 
GGAGTTGAGTGCTGGATTCTTGTAGAGG-3' 



SEQ ID NO: 20 
SEQ ID NO: 21 



30 



5'-ATATCATATGGAGATAGTCATAGTGCTGG-3' 
5'-ATATGGATGGCTAAGACATTTCCATTTGAAATG-3' 



SEQ ID NO: 22 
SEQ ID NO: 23 
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The alpha 2 I domain was cloned by PGR using a HUVEC cDNA Hbrary in the 
vector pcDNA-1 Amp as template and A2.5Nde (SEQ ID NO: 24) and A2.3Bam 
(SEQ ID NO: 25) primers, respectively shown below: 

5'-ATATCATATGGATGTTGTGGTTGTGTGTG-3' SEQ ID NO* 24 

5'-ATATGGATCCCTATGACATTTCCATCTGAAAG-3' SEQ ID NO: 25 

PGR conditions for amplification of both I domains included an initial incubation at 
94°G for 2 min followed by 30 cycles of 94°G for 20 sec; 55^G for 30 sec and ITC 
for 45 sec; and a final incubation at 72^G for 7 min. The PGR products were gel 
purified, digested with me\ and Bamm, gel purified again, and cloned into pET15b 
previously digested with same enzymes. The resulting clones alphal/pet/2 and 
alpha2/pet/27 were sequenced 

The alpha 1, alpha 2 and alpha 1 1 pET15b clones were transformed 
into the bacterial strain BL21(DE3)pLysS (Stratagene) for expression. Histidine- 
tagged proteins were isolated firom the soluble fi-action of the E. coli lysate using a Ni 
-NTA agarose column (QIAGEN) and elution with an imidazole gradient. The eluted 
proteins were dialyzed against CMF-PBS and biotinylated using EZ-Link 
Sulfo-NHS-LG-Biotin (Pierce) according to the manufacturer's suggested protocol. 

An assay for measuring alpha 1 or alpha 2 I domain binding to 
collagen in a 96-well plate format involves binding collagen to the wells of a 96-well 
plate, adding biotinylated alpha 1 or alpha 2 protein to the wells and measuring the 
amount of collagen bound I domain using europium-coupled streptavidin and time 
resolved fluorescence. Immulon4 96-well plates were coated with 20 M.l/ml of rat 
type I collagen (Sigma) in CMF-PBS overnight at 4^G. Wells were washed with 250 
lil of GMF-PBS two times and blocked with 2.5% BSA in GMF-PBS at 30**C for 1 
hr. The wells were washed with 200 [il of GMF-PBS and biotinylated protein was 
added to the wells at 1 jig/ml in either GMF-PBS with 2 mM MgGl2 and 1% BSA or 
in TBS with 2 mM MnGl2 and 1% BSA and incubated at 37^G for 3 hours. The 
wells were washed with 200 ^1 of the same incubation buffers (without I domain 
protein) two times and collagen bound biotinylated protein was detected with the 
addition of 100 |j,1 of a 1 :1000 dilution of streptavidin europium (SA-Eu; Wallac) in 
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SA-Eu dilution buffer (Wallac). Incubation was for 1 hour at room temperature. The 
wells were washed with 200 \il of incubation buffer six times and 100 |al of 
Enhancement solution (Wallac; diluted 1:1 with water) was added to each well for 5 
minutes at room temperature. Fluorescence was measured using the Eugen program. 

Example 11 

Identification of Alpha2 Antagonists of Collagen Binding 

FACS analysis has indicated that Jurkat cells express both alphal and 
alpha2 integrins. Binding studies using monoclonal antibodies to each of these 
integrins has shown that Jurkat cell adhesion to rat type I collagen is mediated 
predominantly through interaction with alpha2. For example, an alpha2 blocking 
monoclonal antibody has been shown to completely inhibit Jurkat cell binding to 
type I collagen. In view of this result, Jurkat cells were employed in an adhesion 
assay as described below to identify inhibitors of alpha2 binding. The assay was 
carried out using a modification of a procedure previously described [Sadhu, et al, 
supra] 

hnmulon 4 plates (Dynex Technologies, Chantilly, VA) were coated 
overnight at 4°C with (i) 50 jil rat type I collagen (Sigma) (20 [ig/ml in CMF-PBS), 
(ii) anti-betal monoclonal antibody 3 S3 (5|ig/ml) in bicarbonate buffer, pH 9.6, (iii) 
or bicarbonate buffer alone. Plates were washed once with 200 |il/well D-PBS and 
blocked with 1% BSA (100 |iywell) in D-PBS for 1 hr at room temperature. Wells 
were rinsed once with 100 (il adhesion buffer containing RPMI and 1% inactivated 
FBS and lOOp-l adhesion buffer containing PMA (10 ng/ml final concentration) was 
added to each well. Adhesion buffer (100 |il) with or without candidate inhibitor (at 
a final concentration of 20 ^M) was added to each well, followed by addition of 100 
\\\ Jurkat cells (1x10^ cells/ml) in adhesion buffer, and incubation carried out at 
3TC for 30 min. Adherent cells were fixed by additional of 50 fil/well 14% 
glutaraldehyde in D-PBS and incubation at room temperature for 2 hr. The plates 
were washed with dHp and stained with 50 |al/well 0.5% crystal violet in 10% 
ethanol for 5 min at room temperature. The plates were washed in several changes of 
dH20, after which 70% ethanol was added. Adherent cells were quantitated by 
determining absorbance at 570 nm and 410 nm using a SPECTRmax 250 microplate 
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spectrophotometer system (Molecular Devices, Sunnyvale CA). The percentage of 
cell binding was determined using the formula below. 

% Binding - A57Q - A41Q(binding to collagen^ X 100 

A570 - A410 (binding to mAb 3S3) 

Data was normalized using the formula: 

% of DMSO binding = % of cell binding, inhibitors X 100 

% of cell binding, DMSO 

One hundred twenty-one compounds inhibited Jurkat adhesion to rat 
type I collagen at a level of 50% or greater than the control. IC50 determinations for 
these inhibitors were assessed in Jurkat adhesion assays as described above except 
that inhibitors were tested at two-fold dilutions through the concentration range of 
0.15 to 40 iiM, The IC50 values of 113 of the 121 compounds were determined and 
21 of these 121 were selected based on potency in the IC50 range of 2 to 17 |j.M in 
the assay and for specificity in showing low level inhibition in the a^^j binding assay 
(described herein) and the von Willebrand factor binding assay (described herein). 
These 21 compounds were further analyzed for specificity and toxicity. In specificity 
determinations, compounds were tested in a concentration range of 0.15 |iM to 20 
[iM for the ability to inhibit Jurkat cell binding to immobilized VCAM-l/Ig. The 
assay was carried out in a manner similar to the collagen adhesion assay described 
above except that cells were coated with VCAM-l/Ig instead of collagen. Binding to 
VCAM-1 was dependent on surface expression ofa^^^. These results are shown in 
Table 6. 

For 21 compounds, toxicity of Jurkat cells was assessed following a 
four hr or 24 hr incubation. LD50 concentrations were determined using a CellTiter 
96® Aqueous One Solution Cell Proliferation Assay System (Promega) according to 
the manufacturer's suggested protocol A two-fold serial dilution series of each 
compound was tested in a concentration range of 40 |xM to 0.15 |j,M. Results fi-om 
the toxicity assay are shovra in Table 6. 



115 



TABLE 6 



COMPOUND 


a2Pi/C0LLAGEN 
EC50 (nM) 


KyPl/VCAM 

EC50 (|iM) 


TOXICITY 
LD50 (nM) 


Cmpd AD 


2 


>20 


3 


CmpdT 


4 


15 


25 


Cmpd AF 


6 


>20 


35 


CmpdAI 


6 


>20 


33 


Cmpd AG 


7 


>20 


35 


Cmpd AE 


7 


>20 


30 


Cmpd Y 


7 


>20 


25 


Cmpd J 


8 


>20 


>40 


CpmdX 


8 


>20 


30 


CmpdM 


8 


>20 


25 


Cmpd AL 


8 


>20 


20 


Cmpd AJ 


8 


>20 


38 


CmpdAK 


9 


>20 


35 


Cmpd AH 


9 


>20 


38 


Cmpd AB 


13 


>20 


>40 


Cmpd A 


14 


17 


>40 


CmpdU 


15 


>20 


>40 


CmpdG 


16 


>20 


>40 


Cmpd E 


16 


>20 


>40 


CmpdB 


17 


>20 


>40 


Cmpd AN 


17 


>20 


>40 



Example 12 

Identification of Alphal Antagonists of Collagen Binding 

Chinese hamster ovary (CHO) cells do not express endogenous 
collagen receptors. Accordingly, CHO cells were transfected with a full-length 
alphal expression construct, alphal/pDC-1/1. The full-length alphal insert was 
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removed from a clone in the vector pLEN [Briesewitz et al, JBC 268:2989-2996 
(1993)] and subcloned into the pDC-1 to generate the clone alphal/pDC-1/1. 
Transfectants were grown in selective media (DMEM/F12 with 10% FBS) and 
cloned by limiting dilution. Alphal expressing clones were identified by staining the 
5 cells with a blocking alphal monoclonal antibody (antibody 5E8D9; Upstate 

Biotech) and determining expression levels by FACS analysis. These cells were 
demonstrated to adhere to type IV collagen in an alphal -dependent manner using the 
blocking alphal monoclonal antibody (Upstate Biotech; clone 5E8D9) which was 
shown to inhibit this adhesion. In view of this result, the alphal transfected CHO 

10 cells were used in an adhesion assay as described below to identify inhibitors of 

alphal binding. This assay is a modification of the procedure used to identify alpha2 
antagonists described above. 

Immulon 4 plates were coated overnight at with either (i) 50 \\\ 
per well human type IV collagen (Sigma) (0.5 (Ag/ml in CMF-PBS), (ii) the 

15 anti-alphal monoclonal antibody 5E8D9 in bicarbonate buffer, pH 9.6, or (iii) 

bicarbonate buffer alone. Plates were washed twice with D-PBS and blocked with 
1% BSA (100 )Lil/well) in D-PBS for 1 hour at room temperature. Wells were rinsed 
once with 100 |al/well adhesion buffer (DMEM/F12 media with no serum). 
Adhesion buffer (200 ^1) with or without candidate inhibitor was added to each well 

20 followed by the addition of 100 |j.1 of alphal transfected CHO cells in adhesion 

buffer. CHO cells were previously recovered using versene and rinsed 3 times in 
DMEM/F12 media containing 10% FBS. Cells were resuspended in adhesion buffer 
at a density of 0.75 x 10^ cells/ml hicubation of the alphal -transfected CHO cells 
on type IV collagen was carried out at 37°C for 30 minutes. Adherent cells were 

25 fixed by additional 50 |il/well 14% glutaraldehyde in D-PBS and incubation at room 

temperature for 2 hours. The plates were washed with dH20 and stained with 50 
|il/well 0.5% crystal violet in 10% ethanol for 5 minutes at room temperature. The 
plates were washed in several changes of dH20 after which 70% ethanol was added. 
Adherent cells were quantitated by determining absorbance at 570 nm and 410 nm 

30 using a SPECTRmax 250 microplate spectrophotometer system (Molecular Devices, 
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Sunnyvale CA). The percentage of cell binding was determined using the formula 
below. 

% binding - (A57Q-A410(binding to collagen) X 1 00 

5 (A570-A410(binding to mAb 5E8D9) 

Data was normalized using the formula: 

% of DMSO binding = % of cell binding, inhibitors X 1 00 
1 0 % cell binding, DMSO 

Sixty- four compounds inhibited alphal-transfected CHO cell 
adhesion to type IV collagen by a level of 50% or greater than the DMSO-control. 
EC50 determinations for these compounds were determined in alphal-transfected 

15 CHO cell adhesion assays as described above, except that the inhibitors were tested 

at two-fold dilutions through the concentration range of 0.15 jiiM to 20 ]xM {i.e., 0.15 
HM, 0.3125 ixM, 0.625 ^iM, 1.25 |xM, 2.50 iiM, 5 i^M, 10 jiM, 20 |iM). The EC50 
values for these compounds ranged from 0.5 jiM to 1 8 {xM. These compounds were 
further analyzed for selectivity and toxicity. 

20 For initial specificity testing, the compounds were tested in a 

concentration range of 0.15 jiM to 20 )xM for the ability to inhibit alpha2-transfected 
CHO cell adhesion to type I collagen. For this assay CHO cells were transfected 
with an alpha2 expression construct, alpha2/pDC-l/8. The original alpha2 construct 
was in the expression vector pcDNA-3 and was a Genestorm clone purchased from 

25 hivitrogen. The alpha2 sequence was subcloned into pDC-1 resulting in the clone 

alpha2/pDC-l/8. Alpha2-expressing cells were cloned and analyzed by FACS using 
an alpha2 monoclonal antibody, A2-IIE10 (Upstate Biotech). A CHO cell line 
expressing moderate levels of alpha2 was identified and used in adhesion assays as 
described above for alphal . The only differences in the alpha2 adhesion assay 

30 included (i) using immobilized rat type I collagen (Sigma) in place of the type IV 

collagen and (ii) using the alpha2 monoclonal antibody, A2-IIE10, in place of the 
alphal monoclonal antibody. Most compoxmds had a narrow range of specificity for 
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alpha 1 compared with alphaZ. These compounds were about 1-3 fold more potent in 
inhibiting alphal dependent adhesion than for inhibiting alpha2 dependent adhesion. 

The toxicity of the compounds was assessed in a 4 hour assay using 
the alphal -transfected CHO cells. LD50 concentrations, (or "Lethal Dose 50"), as 
5 used herein, is the compound concentration necessary to kill 50% of the cells over a 

defined time interval. LD50 concentrations were determined using a CellTiter 96 
Aqueous One Solution Cell Proliferation Assay System (Promega) according to the 
manufacturer's suggested protocol. A two-fold serial dilution series of each 
compound was tested in a concentration range of 40 |iM to 0.15 jiiM. Toxicities for 

10 these compounds ranged fi*om 2.5 |iM to 40 jj,M. 

Thirty-two compounds which were chosen based on their potency, 
selectivity and toxicity profiles were further analyzed for specificity. Compounds 
were tested for inhibiting adhesion of more distantly related I domain-containing 
integrins alphaL (LFA-1) and alphaM (Mac-1). For alphaL, the compounds were 

1 5 tested for inhibition of JY8 cell adhesion to ICAM-1 . 



ICAM-l/JY-8 Cell Adhesion Assay 

Biologically relevant activity of the compounds in the present 

invention was confirmed using a cell-based adhesion assay that measures the ability 
20 of the compoimds to block adherence of JY-8 cells (a human EBV-transformed B 

cell lino expressing LFA-1 on its surface) to immobilized ICAM-1, as follows. 

Compounds were screened for the inhibition of LFA-1 dependent adhesion, as 

described with respect to the alphal assay, with some modifications. Plates were 

coated with ICAM-1 Ig protein (5 (xg/ml in sodium bicarbonate buffer solution) 
25 instead of type IV collagen. JY cells were used in place of K562 [aj cells. The 

capture monoclonal antibody used was 22F12C (at 5 |ag/ml in sodium bicarbonate 

buffer solution) in place of an alphal monoclonal antibody. 

For alphaM, the compoimds were tested for inhibition of Mac-1 

transfected JY cell adhesion to iC3b (assay described in Example 2). For both 
30 assays, compounds were tested in a 2-fold dilution series in a concentration range 

from 20 ju.M to 0.15 |iM. Most compounds were 1-10 fold more effective at 
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inhibiting alphal dependent adhesion than inhibiting LF A- 1 and MAC- 1 dependent 
adhesion. These compounds were also analyzed for toxicity in a 4 hour assay with 
the JY cells as described above for the CHO cells. 

A second alphal -dependent cell adhesion assay was developed to 
further assess the alphal antagonists identified. K562 cells, a myeloid leukemia cell 
line, was transfected with a full-length alphal expression construct 
alphal/pMHneo/40. The alpha l/pMHneo/40 construct was generated by subcloning 
the full length alphal sequence into the expression vector pMH-neo [Hahn et al. 
Gene 127:267-268 (1993)]. Transfectants were selected with 0.5 mg/ml G418. In 
order to further select for alphal -expressing cells, the transfectants were panned for 
adhesion to type IV collagen. For the panning, tissue culture plates were coated with 
20 mg/ml of human type IV collagen (Sigma) in CMF-PBS for 1 hour at 37°C. The 
plates were washed with bmding buffer (RPMI with 10% FBS) and the alphal - 
transfected K562 cells were added in binding buffer containing 20 ng/ml PMA. 
After incubation at 37°C for 1 hour, the plates were washed to remove unbound cells. 
Adherent cells were removed with versene and diluted with binding buffer. After 
panning, K562 cell lines expressing alphal were obtained and used for further 
screening described below. 

Twenty-one alphal antagonists identified in the CHO cell adhesion 
assay were further analyzed in an alphal -transfected K562 cell adhesion assay. The 
cell adhesion assay was performed as described above for the CHO cell assay except 
that RPMI was used as the adhesion buffer. The potencies of the alphal antagonists 
were similar in the K562 and CHO cell adhesion assays with most EC50 values for 
most compounds falling within a 1-3 fold range between the two assays. The 
compounds were also analyzed for toxicity with the K562 cells in a 4 hour assay 
using the CellTiter 96 Aqueous One Solution Cell Proliferation Assay System 
described above. The toxicities (LD50) of most compounds was similar in the K562 
and the CHO assays. The LD50 values between the two assays varied by less than 2- 
fold for the majority of compounds. 

The structures of five alphal antagonists are shown in Table 7. These 
compounds have EC50 values in the range of 0.5 - 1 .S^iM. These compounds have 
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narrow specificity for alphal over alpha2 (1-4 fold), and greater selectivity over 
more distantly related integrins, such as LFA-1 and Mac-1 (3-10 fold). The 
window between potency (EC50) and toxicity (LD50) ranges from 6 - 20 fold. 
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TABLE 7 
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Example 13 

Expression and Purification of Alphal I domain 
and its Usage in a Biochemical Assay 

An alphal I domain construct was generated for expressing the alphal 
I domain as a histidine tagged protein in E. coli. The histidine-tagged protein was 
used in co-crystallization experiments to determine the 3-dimensional structure of the 
alphal I domain complexed with inhibitors. The histidine tagged protein was also 
used to assess alphal antagonists in a biochemical assay by measuring the binding of 
the alphal I domain to immobilized collagen. 

The alphal I domain was cloned as follows. A polynucleotide 
encoding the alphal I domain was PGR amplified using the Al J.Bam (SEQ ID NO: 
26) and Al.LPst (SEQ ID NO: 27) primers shown below and the vector 
alphal/pDC-1/1 as template. 

Al .LBam : CGGATCCCCCACATTTCAAGTCGTGAAT SEQ ID NO: 26 

Al .LPst : GCTGCAGTCATATTCTTTCTCCCAGAGTTTT SEQ ID NO: 27 

PGR conditions included an initial incubation at 94°C for 2 minutes, followed by 30 
cycles of 94^C for 30 seconds; 55°C for 30 seconds and 72°C for 30 seconds; and a 
final incubation of 72°C for 7 minutes. The resulting PGR product was gel purified, 
digested with BamHI and PstI, gel purified again and then cloned into the vector 
pQE30 (Qiagen) previously digested with BamHI and Pstl The resulting clone 
alphal/pQE30/2 was verified by sequencing. 

The alphal/pQE30/2 construct was transformed into E. coli strain 
M15(pREP4) (Qiagen) for protein expression. Histidine-tagged alphal I domain was 
solubilized from purified inclusion bodies using 6 M guanidine and then snap 
refolded by dilution in buffer without guanidine. The solubilized alphal I domain 
was purified using a Ni-NTA agarose column (Qiagen) and elution with an imidazole 
gradient. 

The purified alphal I domain was used in direct binding assays with 
immobilized type IV collagen as follows. Gostar Immulon4 plates (96 well) were 
coated overnight with either (i) 50 [il/well of human collagen IV protein (Sigma) at 
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40 \ig/m\ in CMF-PBS, (ii) anti-alpha 1 I-domain monoclonal antibody (Immune 
Diagnostics) at 10 |ig/ml in CMF-PBS (positive control), or (iii) CMF-PBS alone 
(negative control). Plates were incubated overnight at 4°C. The next day, media was 
removed and the plates were blotted dry, after which 150 |xl/well of 2% BSA in 
CMF-PBS containing 0.05% Tween-20 was added to block the plates, and plates 
were incubated further at 37°C for 1 hour. Media was again removed from the plates 
which were blotted dry, and then washed twice with 150 |al/well of CMF-PBS 
containing 0.05% Tween-20 and 5 mM MgCl2 (PBS/T/Mg). Approximately 50 
M,l/well of PBS/T/Mg containing 2X compound, DMSO, anti-alpha 1 I-domain 
monoclonal antibody, isotype-matched control monoclonal antibody or no inhibitor 
was added to the plates, after which 50 |il/well of PBS/T/Mg containing 20 fig/ml of 
purified alpha 1 I-domain was added and plates were incubated for 30 minutes at 
37^C. Media was removed and the plates were blotted dry, then washed twice with 
100 ^iVwell PBS/T/Mg, after which 100 ^xl/well PBS/T/Mg containing 1 |ig/ml 
anti-penta-His monoclonal antibody (Qiagen) was added. The plates were then 
incubated for 30 minutes at 37'^C, the media was removed, and the plates blotted dry. 
The plates were then washed twice with 100 nl/well PBS/T/Mg, 100 ^ll/well 
PBS/T/Mg containing a 1 :20,000 dilution of GAM-HRP (Sigma) was added, and the 
plates were incubated for 30 minutes at 37°C. Media was removed, and the plates 
were blotted dry. The plates were washed twice with 100 jal/well PBS/T/Mg and 100 
^ll/well of Substrate Buffer containing 150 [iVl of Hp^ and a 1:100 dilution of TMB 
substrate stock was added to each well and the plates were developed in the dark for 
30 minutes at room temperature. Fifty |il/well of 15% H2SO4 was then added to stop 
the reaction and the plates were analyzed by A450 -A^^o on a spectrophotometer. The 
specific signal was determined by subtracting background binding to "negative 
control" wells. 

An additional assay was developed using Europium-labeled alpha 1 I 
domain and bound I domain was directly detected after washing using time resolved 
fluorescence (TRF). In this assay, purified alpha-1 1 domain was labeled with 
Europium using a DELFIA Europium-labeling kit according to the manufacturer's 
suggested protocol (Wallac). Costar Immulon4 plates (96 well) were coated with 
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100 M,l of 25 |ig/ml of human collagen IV protein (Sigma) in CMF-PBS/1 MgCl^, 
and incubated overnight at 4°C. Plates were then washed 3 times with TBS/T (20 
mM Tris, pH 8.0; 150 mM NaCl; 0.02% Tween-20) and ImM MgCl^ (TBS/T/Mg), 
200 \i] per well Plates were then blocked with CMF-PBS/lmM MgCl2/2% BSA, 
100^1 per well for 1 hour at 37°C. Plates were washed again, then probed with 5 
g/ml Europium I domain in RPMI/5%TBS/lmM MgCl2, lOOjil per well, and 
incubated at ST^'C for one hour. Plates were then washed again and developed by 
adding 100 [iV well Enhance (Wallac) and analyzed on a Victor plate reader by time 
resolved fluorescence. 

Example 14 

Expression and Purification of AlphalBetal Leucine Zipper Protein 
and its Usage in a Biochemical Assay 

In order to develop a more physiologically accurate biochemical 
assay, an alphalbetal leucine zipper protein was generated. Expression constructs 
were prepared individually encoding the full length extracellular domains of alphal 
and betal without the transmembrane and cytoplasmic tail polypeptide sequences. 
Removal of the transmembrane regions allows these proteins to be secreted jfrom 
transfected cells providing easy purification. The transmembrane sequences were 
replaced with the acidic and basic leucine zipper sequences respectively. See 
generally, Chang et al, PNAS 91 :1 1408-1 1412 (1994). 

The extracellular domain of alphal was subcloned from the original 
alphal clone in pLEN [Briesewitz et al, JBC 268:2989-2996 (1993)]. The 
extracellular domain of betal were subcloned from the full length betal clone 
He6. 1 .2/pcDNA-l Amp. This clone was obtained by screening a Hela cDNA library 
by hybridization. The leucine zipper sequences promote the formation of the 
alphalbetal heterodimer. These constructs were generated using the same leucine 
zipper sequences and vectors described in U.S. Patent No. 6,251,395, issued on June 
26, 2001, Example 14 of which is hereby incorporated herein by reference for its 
description of methods for constructing leucine zipper proteins. The alphal and 
betal leucine zipper constructs were co-transfected into CHO cells which were then 
maintained in DMEM/F12 media with 10% dialyzed FBS. Supematant was 
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collected and the secreted alphalbetal heterodimer was purified using 
chromatography over CNBr-activated Sepharose 4B (Pharmacia) coupled with an 
anti-leucine zipper monolconal antibody which recognizes both chains of the leucine 
zipper. For use in biochemical assays, purijfied alphalbetal leucine zipper protein 
5 was Europium labeled using a DELFIA Europium-labeling kit according to the 

manufacturer's suggested protocol. Binding of the labeled alphalbetal leucine 
zipper protein to immobilized collagen was measured by time resolved fluorescence. 
The heterodimer assay was set up essentially the same as the Europium labeled I 
domain assay, with the exception that Europium labeled heterodimer in CMF- 
10 PBS/lmM MgCl2/2% BSA was substituted as the probe for the Europium labeled I 

domain. 



Example 15 

Von Willebrand Factor/gpIb-CHO Static Cell Adhesion Assay 
1 5 The Al 1 domain in von Willebrand factor (vWf) is homologous to I 

domains found in other proteins. To investigate the possibility that these molecules 
might be susceptible to similar modulation as described above, the library of test 
compounds were tested for the ability to modulate vWf binding to gplb. 

Round-bottom (RB) glass plates were coated overnight at 4°C with 50 
20 |il of 1 |j,g/ml bovine vWf (bvWf) in CMF-PBS. Control wells include wells that 

were coated with 5 |ig/ml of vWf at 50 jil/well, or with fibrinogen at 10 |ag/ml. The 
next morning, the plate was washed once with 200 [il of CMF-PBS, blocked with 
200 \i\ of 2,5% gelatin for 1 hr at 37^C, and washed three times with 200 ^l 
CMF-PBS. 

25 CHO cells transfected with DNA encoding glycoprotein(GP) Ib-IX 

[Cranmer, J. Biol Chem, 27^:6097-6106(1999)] were grown in DMEM/F12 with 
10% FCS, antibiotics, glutamine and 5-hydroxytryptophan supplemented with 400 
fxg/ml G418 and 200 |ig/ml zeocin (Invitrogen). Confluent cells were washed once 
with CMF-PBS and incubated with warm Versene in incubator for 5 min. Cells were 

30 collected and resuspended in Tyrode's solution (Sigma) with 4 mM EDTA at a 

density of 2 x 10^ cells/ml. 
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The library of test compounds were diluted in Tyrode's/EDTA to 20 
^iM and 50 [il of the diluted compound was added to each well to a final 
concentration of 10 [iM. For the control, 1 fil 100%DMSO was added to 300 \i\ cell, 
with the final concentration of DMSO 0.3%. In a control with a known vWf 
5 inhibitor, aurin-tricarboxylic acid (ATA, Sigma) was dissolved in 100% DMSO to 20 

mM, diluted with Tyrode's /EDTA to 20 [iM, and 50 |il/well to final concentration of 
1 0 |iM was added. 

Cells were added to each well at a density of 10^ cells/well in 50 (xl 
and the plates rocked for 40 min at room temperature. The non-adherent cells were 
10 removed by aspiration, 200 [xl CMF-PBS was added, the plates vortexed and the 

buffer removed. Calcein was added (50 p-l/well of a 2 \xM stock) and the plates 
incubated at room temperature for 1 hr to label adherent cells. Fluorescence was 
measured on a Millipore CytoFluor 2350 fluorimeter to quantitate adherent cells. A 
number of compounds having IC50 values less than 20 |iM were identified. 

15 

Example 16 

CDl lb-Mediated Neutrophil Adhesion to Fibrinogen 

The adhesion assay described above for CD18/CD1 lb- (Mac-1)- 
mediated adhesion of HL-60 cells to ICAM-1 was carried out with the following 

20 modifications. Each well was coated overnight at 4'*C with 50 |il of glycophorin (10 

|ig/ml), fibrinogen (5 |xg/ml) or with anti-CD 18 monoclonal antibody (22F12C, 5 
|ig/ml) and anti-CDl lb monoclonal antibody (44AACB, 5 |ig/ml) in 50 mM 
bicarbonate buffer (pH 9.6). Plates were blocked with 1% human serum albumin and 
no blocking antibody was used. Neutrophils were isolated firom fi^esh heparin whole 

25 human blood by density gradient centrifugation and 100 \il of the cells (4x10^ 

cells/ml) in adhesion buffer was added to each well. Plates were incubated at ST'^C 
for 10 minutes. 

Over 1000 compounds were screened, and several had IC50 values 
ranging firom 1 \iM to 40 |aM. Nine compounds were found to have IC50 values 
30 below 10 \iM [Cmpd S, Cmpd R, Cmpd N, Cmpd O, Cmpd P, Cmpd Q, Cmpd L, 

Cmpd V, and Cmpd F, as set out in Table 2. After SAR efforts based on compound 
Cmpd S (which initially showed an IC50 of 1 |aM), inhibition potency for 
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compounds improved to less than 200 nM (for Cmpd AA and Cmpd AC) and several 
compounds showed complete inhibition at 20 [iM [compounds ranging from Cmpd Z 
to Cmpd AM]. With the exception of Cmpd Z all compounds selected for their 
ability to inhibit aEp^/E-cadherin also antagonize the other integrin, a^^j. These 
5 compounds also exhibited minimal inhibitory activity towards aLp2. ^<32^ 

Also, relative to aEpyE-cadherin, these compounds display limited selectivity (less 
than 2-fold) for a^^j <^vp3- 

Example 17 

10 Development of Inhibitors of Racl Guanine Nucleotide Exchange Reaction 

Rac proteins are not active when bound to GDP, but are activated by 
the exchange of GDP for GTP. The exchange of GDP for GTP in Rac proteins is 
catalyzed by guanine nucleotide exchange factors (GEFs) such as, Vavl and Tiaml 

15 [Aghazdeh et al. Cell 102:625 (2000); Worthylake et al. Nature 408:682 (2000)], 

Due to the importance of Rac proteins in the control of cell proliferation, antagonists 
of the Rac guanine nucleotide exchange reaction and, in particular, small molecules 
that interfere with the exchange of GDP for GTP of Racl in the presence of Tiaml, 
are of considerable interest for the methods and compositions of the present 

20 invention. 

A. Cloning and expression of Racl and Tiaml : 

Racl and the DH-PH domain of Tiaml were cloned using standard 
recombinant DNA procedures [Disbury et al, J. Biol. Chem. 264:16378 (1989)]. 
25 Racl was expressed in E, coli as a GST fusion protein using the vector pGEX2T in 

accordance with previously described methods [Self and Hall, Meth. Enzymol. 256:3 
(1995)]. Purified thrombin-cleaved Racl protein was used in the assay. 

The Tiaml DH-PH domain expressed as a fusion protein containing a 
carboxy terminal 6XHis tag using the plasmid pET28a described by Rossman and 
30 Campbell, Meth. EnzymoL 325:25 (2000). 
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B. Guanine nucleotide exchange assay : 

The Tiaml -catalyzed exchange of GDP for GTP of Racl was carried 
out essentially according to the procedure described by Crompton et al, J. 
BioLChem. 275(33):25751 (2000). GDP-bound Racl was incubated with [a^^P]- 
labeled GTP, in the presence of Tiaml and nucleotide exchange was monitored by 
following the increase in radioactivity bound to Racl . Free radioactivity was 
removed by placing the reaction mixture in the well of a 96-well plate and filtering 
out the fraction of [a^^PJGTP that is not bound to Racl . During the screen for the 
nucleotide exchange antagonists, the compounds were used at 10 [xM concentration. 
The compounds analyzed by the screening methods are further described below. 

C> Cell proliferation assay : 

Rat embryonic fibroblast (REF) and Jurkat cells were selected as 
representatives of fibroblastic and T cells respectively in order to test the effect of 
Racl guanine nucleotide exchange inhibitors on cell proliferation. REF cultured in 
the medium RPMI ("REF-R cell culture") was obtained as described by Nobes, 
Meth. Enzymol. 325:441 (2000). REF-R or Jurkat cells in complete RPMI and 10% 
fetal bovine serum (FBS) were plated into 96 well plates in duplicate, 10 x 10^ 
cells/well. After 21 hrs (for Jurkat), and 45 hrs (for REF-R), AlamarBlue (Serotec) 
was added, and cells were returned to the incubator (37°C, 5% CO2) for additional 3 
hrs. Results were obtained with SpectraMaxGEMINI (Molecular Devices). 

Compounds that inhibit the Racl guanine nucleotide exchange 
reaction by at least 50% of the control were obtained. The IC50 value of several of 
the compounds were determined for the guanine nucleotide exchange reaction of 
Racl, in the presence of Tiaml . The five structures shown in Table 8 represent the 
most potent inhibitors, and IC50 values for the guanine nucleotide exchange reaction 
for these compoimds are also included therein. 



129 



TABLE 8 



Compound 


IC50 
(txM) 


<^^^^ 

\ HO p HO p 


1.9 


AZ 


3.1 


AAA 

^ JI Jl 0 F 
F F 


3.5 
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Example 18 
Inhibition of Bacterial Proteins 

Three microbial enzymes containing Rossmann fold structures were 

identified as candidates for screening with the library of test compounds. Selection 

5 was based on (i) presence of the Rossmann structure; (ii) expression patterns in 

prokaryotic and eukaryotic cells; (iii) clinical importance; and (iv) functional 

importance to bacterial growth and survival. Two of the selected proteins, 

dihydrodipicolinate reductase (DHPR or DapB) and enoyl-acp reductase (ENR), 

catalyze electron transfer from NADH to a substrate and are integral to biosynthetic 

1 0 pathways for lysine synthesis and fatty acid synthesis, respectively. The third and 

fourth proteins, E, coli ra^-like GTPase (ERA-GTAse) and yihA (also a GTPase), are 
involved in translation and cell cycle regulation. 

Modulation of DapB activity is assessed using an optical assay that 
involves synthesis of dihydrodipicolate from aspartate semialdehyde. The assay 

15 utilizes dihydrodipicolate synthease (Dap A) to first synthesize dihydrodipicolinate, 

followed by addition of NADH and DapB. A coupled reaction is necessary because 
dihydrodipicolate is an unstable compound. The change in absorbance in the 
presence and absence of a test compound resulting from NADH conversion to NAD^ 
is monitored at 340 nm. 

20 Identification of modulators of ENR is carried out in a similar 

manner, but in a single step reaction. Briefly, NADH and ENR are first incubated, 
followed by addition of substrate, (either crotonyl-CoA or crotonyl-ACP). Again, the 
change in absorbance in the presence and absence of a test compound resulting from 
NADH conversion to NAD^ is monitored at 340 nm. 

25 Li view of the fact that optical assays require large amount of 

substrate, i.e., crotonyl Co A, and the fact that several test compounds absorb at the 
same wavelength as NADH, alternative thin layer chromatography (TLC) and plate- 
based assays were designed to identify modulators of ENR using radiolabeled 
NADH. 

30 The TCL method measures conversion of ^^P-NADH to NAD^ in the 

presence of lithium chloride which causes the two sates to separate on PEI 
membranes after a 5 to 10 min run time. Radiolabeled spots are measured on a 
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Storm Phosphoimager and the ratio of NAD^ to NADH is calculated. An increase in 
the ratio of NADH to NAD^ in the presence of a test compound is indicative of 
inhibition of the conversion. The control reaction is optimized to measure 
conversion in the linear range. Practical application of this assay was demonstrated 
5 using a commercially available enzyme inhibitor. The TLC method is particularly 

useful for small scale screening. 

For large scale screening, the plate based assay is designed to utilize 
the same reagents. This assay exploits the charge difference between NADH and 
NAD"" to permit separation. Positively-charged DEAE-cellulose membrane is used 

1 0 to selectively trap ^^P-NADH which has a net negative charge greater than NAD^. 

Trapped NADH is detected using scintillation counting and increased signal in the 
presence of a test compound indicates enzyme inhibition. 

For ERA-GTPase, a one step assay is carried out to identify 
modulators. The transfer of labeled phosphorus in the conversion of GTP to GDP is 

15 measured in the presence and absence of a test compound, the label being detected in 

a scintillation counter using an assay routinely practiced in the art. Conditions for the 
ERA GTPase assay can also be utilized in screening for yihA modulators. 



Example 19 

20 Identification of HPPK Antagonists 

The enzyme 6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase 
(HPPK) is part of the de novo folate biosynthetic cascade and catalyzes the transfer 
of pyrophosphate from ATP to 6-hydroxy-7,8-dihydropterin (HMDP) [Richey et aL, 
J. Biol. Chem. 244:1582-1592 (1969)]. HPPK is expressed in both gram positive and 
25 gram negative bacteria, fungi, and protozoa, but not in higher eukaryotes. 

Accordingly, HPPK represents a novel target for the development of antibiotics with 
anti-folate activity. 

1. Isolation of the E. coli HPPK Gene 
30 The E, coli HPPK gene v^as isolated by PGR ampUfication ofE. coli 

genomic DNA with the following oligonucleotide primers specific for the 5* (SEQ ID 
NO: 28) and 3' (SEQ ID NO: 29) ends of the HPPK gene: 
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5'EcHPPK 5^-GTAGATGACAGTGGCGTATATT-3' SEQ ID NO: 28 

3'EcHPPK 5'-GCCTTACCATTTGTTTAATTTGT-3' SEQ ID NO: 29 



PGR was performed in a Perkin Elmer Cetus (PE Applied Biosystems, Foster City, 
5 CA) DNA thermal cycler under standard conditions. See generally, Ausubel et. al. 

Current Protocols in Molecular Biology, Vol. 3, p. 15.1.1 - p. 15. 1.15 (1999). The 
amplification products were then analyzed by agarose gel electrophoresis to 
determine the approximate size of the PGR product, and a single DNA fragment of 
approximately 487 bp was detected, as anticipated. 

1 0 The HPPK PGR product was ligated to the vector pCRH-TOPO 

(Invitrogen Corp., Carlsbad, CA) according to the manufacturer's protocols. E, coli 
strain TOP 10 (Invitrogen Corp., Carlsbad, CA) was transformed with an ahquot of 
the ligation reaction, as recommended by the manufacturer, and single bacterial 
colonies were isolated and grown overnight in LBM media containing 100 jig/ml 

15 carbenicillin. 

Plasmid DNA was isolated from 2 ml cultures of the single colonies 
using the Wizard Plus Miniprep Kit (Promega Corp., Madison, WI). The DNA 
sequence of the E, coli HPPK PGR product in the plasmid pCRn-TOPO/EcHPPK 
was determined to be correct, having the amino acid sequence set out in SEQ ID NO: 
20 30. 



2. Generation of His(6)-HPPK Expression Constructs 

In order to facilitate the purification and detection of coli HPPK, 
the following changes were made to the E. coli HPPK coding sequence during a 

25 subsequent PGR amplification: 1 .) the amino terminus of HPPK was modified to 

incorporate an additional 6 histidine residues, and 2.) unique restriction sites were 
added to the 5' and 3' ends of the coding region to facilitate subcloning of the PGR 
fragment into the expression vector pBAR5. Methods for the subcloning of a similar 
PGR fragment into an expression vector have been previously described in U.S. 

30 Patent No. 5,847,088, issued December 8, 1998, Example 8 of which is hereby 

incorporated herein by reference. The 5' PGR primer included an Ncol restriction 
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site followed by sequences encoding the additional amino acid residues 
"MGHHHHHHGG" (SEQ ID NO. 31) as shown below: 

5'EcHisHPPK SEQ ID NO: 32 

5 5'-CGCCATGGGCCACCACCACCACCACCACGGCGGCATGACAGTGGCGTA 
TATT-3' 

The 3' PGR primer included a Xhol restriction site and is shown below: 

1 0 3'EcXhoHPPK SEQ ID NO: 33 

5'-CGGCTCGAGTTACCATTTGTTTAATTTGT-3' 

Using these primers, the 487 bp HPPK PGR product was ampUfied in a standard 
PGR amplification reaction, and an aliquot of the reaction was analyzed by agarose 

1 5 gel electrophoresis. A single band of approximately 5 1 9 bp that corresponded to the 

anticipated size was detected. The PGR amplification product was purified using a 
QIAquick PGR Purification Kit (QIAGEN Inc., Valencia, CA), and digested with the 
restriction enzymes Ncol and Xhol. The digested PGR product was Ugated into 
Ncol- and Xhol-digested plasmid pBAR5, and an aliquot of the ligation reaction was 

20 used to transform TOP 10 bacteria according to the manufacturer's protocols 

(Invitrogen Corp., Carlsbad, CA). Single colonies were isolated after plating on 
LBM agar plates containing carbenicillin. Several of the single colonies were grown 
overnight in 2 ml cultures of LBM containing carbenicillin, and plasmid DNA was 
isolated for DNA sequencing as previously described. The plasmid 

25 pBAR5/HisHPPK was shown to contain an open reading fi-ame encoding the 

His(6)-HPPK gene having the following amino acid sequence set out in SEQ ID NO: 
34. 

3. HisHPPK Expression 

30 Plasmid pBAR5/HisHPPK was used to transform the E. coli strain 

BL2 1 (DE3)pLysS (Novagen Inc., Madison, WI) using standard methods. 
Transformants were selected after plating onto LBM plates containing both 
chloramphenicol and carbenicillin, to select for the plasmids pLysS and 
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pBAR5/HisHPPK respectively. Plasmid pLys is a plasmid that encodes T7 
lysozome. The presence of lysozome aids cell lysis following a freeze-thaw cycle. 

To initiate large-scale expression of HisHPPK, a 50 ml culture of 
BL2 1 (DE3)pLysS containing pB AR5/HisHPPK was grown overnight at 30°C with 
5 shaking in LBM containing carbenicillin and chloramphenicol. The following day, 

10 ml of the overnight culture was used to inoculate 2 liter flasks containing 500 ml 
of LBM supplemented with carbenicillin and chloramphenicol. The flasks were 
incubated at 37°C with shaking until the bacterial cultures reached an OD^qq of 
approximately 0.6. 

1 0 The plasmid pBAR5/HisHPPK contains an arabinose-inducible 

promoter upstream of the HisHPPK gene. Once the cultures reached appropriate 
density, arabinose was added to the cultxures to a final concentration of 0.1% to 
induce HisHPPK expression, and the flasks were incubated at 37°C with shaking for 
another 2.5 hours. The bacteria were then harvested by centrifugation and the cell 

15 pellet from 1 liter of bacterial culture was resuspended in lysis buffer [50 mM 

Na2HP04, pH 8, 50 mM imidazole, 10 mM P-mercaptoethanol, 0.5 M NaCl, and 
EDTA-free protease inhibitor cocktail tablets (Roche Molecular Biochemicals, 
Lidianapolis, IN)] to a final volume of 35 ml. Each 35 ml bacterial suspension was 
transferred to a 50 ml polypropylene tube, snap frozen on dry ice, and then stored at 

20 -20°C. 



4. Purification of HisHPPK 

Each 35 ml aliquot was thawed on ice and lysed in a French press. To 
obtain a cleared lysate representing the soluble protein fraction, the lysate was 
25 centrifuged for 30 minutes at 20,000 x g, at 4°C. HisHPPK was purified using a 

two-step procedure. 

First, bacterial proteins that bound the Ni-NTA agarose (QIAGEN 
Inc., Valencia, CA) nonspecifically were removed by incubating the cleared lysate 
with NTA agarose which had been previously treated with EDTA to remove 
30 associated Ni^^cations. The 35 ml of cleared lysate was incubated batchwise with 1 

ml of NTA agarose for approximately 1 hour at A'^C after which the NTA resin was 
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removed by centrifugation. HisHPPK was purified on Ni-NTA agarose according to 
the manufacturer's protocols (QIAGEN Inc., Valencia, CA). The isolated HisHPPK 
protein was resolved on a 12% Novex gel (Invitrogen Corp., Carlsbad, CA), the gel 
was fixed and stained with Coomassie brilliant blue under standard conditions, and 
5 the only protein identified in the HisHPPK preparation was a single species of about 

19 kD in mass, which corresponds to the anticipated size of HisHPPK. The protein 
was dialyzed against 20 mM Tris, pH 8, aliquotted, and stored at -70°C. 

5. Screening Assay for HPPK Activity 

10 hi order to identify small molecule inhibitors of HPPK, an assay for 

HPPK measuring the HPPK-dependent conversion of ATP to AMP as a by-product 
of the pyrophosphorylation of 6-hydroxymethyl-7,8-dihydropterin (HMDP) was 
employed [Shi et al J, Med, Chem, 44:1364-1371 (2001)]. Elevated concentrations 
of both substrates (HMDP and ATP) were used in the assay to reduce the possibility 

1 5 of identifying substrate competitors. This reaction was modified for use in 96-well 

V-bottom polypropylene plates as follows 

A master mix of the following composition was prepared containing 
50 mM Hepes, pH 8.5, 100 ^M HMDP (Schircks Laboratories, Jona, Switzerland), 
10 mM MgCl2, 35 jiM adenosine triphosphate, and 10 ng of y-labeled ^^P-ATP 

20 (Amersham Pharmacia Biotech, Arlington Heights, IL). An aliquot of the master 

mix was added to each well of the 96-well assay plate. Also added to the assay plate 
was 5 ^1/well of the candidate inhibitor compound at a final screening concentration 
of 20 (iM. Each candidate compound was diluted in DMSO prior to addition to the 
assay plate; and the final concentration of DMSO in the final assay mixture was 5%. 

25 The reaction was initiated by the addition of 100 ng of purified HisHPPK, and 

allowed to proceed for 1 5 minutes at 37^C. The reaction was stopped by the addition 
of an equal volume of 120 mM EDTA to each well. To resolve radiolabeled ATP 
from AMP, 2 \i\ of the reaction volume was spotted onto a PEI cellulose plate and 
the plate was developed with 0.3 M KH2PO4. The radioactivity of the plate was 

30 measured with a system Molecular Dynamics Storm 860 Phosphor imager system 

(Molecular Dynamics Storm, Sunnyvale, CA). The HPPK enzymatic activity in the 
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presence of compound was inferred from the percent conversion of radiolabeled ATP 
to AMP in duplicate test samples relative to a DMSO-only control reaction. Since 
nonspecific background in samples lacking substrate was less that 1%, and no 
correction was made. Approximately 2,520 compounds were screened, and 
approximately 58 compounds inhibited HPPK activity by 55% or greater, yielding a 
hit rate of 2.3%. These compounds were ranked for in vitro potency by IC50 
determinations. 

6. Effect of HPPK Antagonists on the Bacterial Growth ofE. coli TolC 

The minimal inhibitory concentration (MIC) required to inhibit the 
growth of E, coli, using a microtiter broth assay, was measured in order to determine 
the in vivo activity of the HPPK hits. The MIC is defined as the minimum 
concentration required to reduce growth 80% compared to DMSO-only controls. 
More specifically^ the efficacy of these compounds was measured against an E. coli 
strain containing a mutation in the TolC gene. The TolC gene encodes a 
transperiplasmic efflux pump which facilitates the export of small molecules such as 
protein toxins and antibiotics fi-om the bacterial cytosol [Andersen et al Curr. Opin. 
Cell. Biol. 13:412-416 (2001).] Although, this mutation has no affect on the entry of 
compounds into the bacterium, some compounds prone to elimination via the efflux 
pumps may reach a higher intracellular concentration in the TolC mutant. All 
microtiter broth assays followed those protocols established by the National 
Committee for Clinical Laboratory Standards [Methods for Dilution Antimicrobial 
Susceptibility Tests for Bacteria that Grow Aerobically; approved standard-5th 
Edition, Vol. 20, No.2. NCCLS Guidelines. Wayne, Pennsylvania (2000).] 

Microtiter broth assays were performed in Mueller-Hinton broth, 
which contains low thymidine levels. The presence of thymidine in bacterial media 
antagonizes the activity of the anti-folates trimethoprim and sulfamethoxazole, and 
likely antagonizes HPPK inhibitors as well. 

Compounds were serially diluted two-fold in DMSO prior to addition 
to the microdilution plates. Each plate contained two controls: a serial dilution of 
trimethoprim provided a positive control for each plate, and a second row containing 
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uninoculated Mueller-Hindon broth served as a sterility control for monitoring 
cross-contamination between wells. The inoculum density was approximately 10^ 
bacteria/ml in a final volume of 100 \xl Plates were incubated for 16 hours before 

OD^oo measured. 

The four compounds with the greatest activity in the MIC assays are 
shown in Table 9. The minimal inhibitory concentration of these compounds in E, 
coli TolC ranged from 0.1-12.5 [iM. However, the MIC assays do not distinguish 
between bacteriostatic and bacteriocidal modes of action, nor do they determine if 
these compounds selectively inhibit HPPK in vivo. Experiments are underway to 
determine if these compounds have anti-folate activity and inhibit HPPK in vivo. It 
is well established that the activity of conventional anti-folates such trimethoprim 
and sulfamethoxazole are antagonized by the presence of thymidine in the bacterial 
medium [Amyes and Smith, J. Med. Microbiol. 7(2):143-153 (1973)]. Experiments 
to determine the MIC for each compound in Mueller-Hinton media alone, or 
following supplementation of the media with thymidine will be conducted. If the 
diarylsulfide compounds inhibit HPPK in vivo, then their activity should be 
attenuated in the presence of the folate end-product thymidine. Alternatively, these 
compounds can be analyzed for their abiUty to synergistically inhibit bacterial growth 
when paired with trimethoprim. Synergism would only occur if both the 
diarylsulfide compoimd and trimethoprim were acting on the same biochemical 
pathway. The combinatorial analysis of trimethoprim and a test compound are 
performed in a standard "checkerboard" study where these compounds are 
cross-titrated and analyzed for their effect on bacterial growth in a microtiter broth 
assay as previously described [Ehopoulos and Moellering, Jr., Antimicrobial 
Combinations, pp. 330-393, in Antibiotics in Laboratory Medicine, 4th Edition.(V. 
Lorianed., 1996)]. 
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TABLE 9 



AAD 



AAE 



AAF 
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AAG 
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Example 20 
Assays for the Identification of ftsZ Inhibitors 

FtsZ is the product of an essential bacterial gene that is involved in 

cell division. FtsZ binds and hydrolyzes GTP, and when bound to GTP it forms 

5 long, linear polymers. The GTP-dependent polymerization of ftsZ is related to its 

function in bacterial cell division. During septation, ftsZ forms a ring to define the 

plane of cell division. Cells lacking ftsZ can not undergo septation, do not divide 

and die. FtsZ is highly conserved (approximately 60%) throughout the bacterial 

kingdom. Accordingly, ftsZ inhibitors could represent broad-spectrum antibiotics 

1 0 with a novel mechanism of action. The atomic structure of ftsZ, as determined by x- 

ray diffraction, shows that it is an alpha/beta protein [Nogales et ah, (1998) Nature 
Structural Biology 5:451-458]. The most similar structural relative to ftsZ is the 
eukaryotic protein tubulin, which is a GTP-binding and hydrolyzing protein that also 
polymerizes to form microtubules with an essential role in the segregation of 

15 organelles and chromosomes during cell division. 

A polymerization assay for the identification of ftsZ inhibitors that 
can be performed in microtiter wells has been devised. The polymerization assay is 
an adaptation of a tubulin polymerization assay [BoUag et al, Cancer Research 
55:2325-2333 (1995)], and involves the reversible polymerization of ftsZ in a 

20 GTP-dependent fashion. 

In the presence of GTP and lOmM CaCl2, 5 nm ftsZ linear polymers 
assemble into higher order polymers [Yu et al, EMBO 16:5455-5463 (1997)] that 
are large enough to be trapped by a 0.2!Lim filter. The protein that is retained on the 
filter can be stained and detected in a colorimetric assay. A reaction consisting of 

25 300 p-g/ml of ftsZ polymerized by 1 OOfiM GTP was screened against candidate flsZ 

inhibitors at lO^iM. 

An alternative assay that may be more sensitive was also devised. In 
this assay, 100|iig/ml ftsZ was incubated with 0.5 \xM ^^P-y-GTP. The GTPase 
activity of ftsZ liberates ^^P04. By terminating the reaction with 25 mg/ml activated 

30 charcoal in lOOmM NaH2P04 and centrifaging the product, the remaining -^^P-y-OTP 

is trapped by the charcoal. Accordingly, the ^^P04 that remains in the supernatant 
can be measured, providing a measurement of GTPase inhibition. This screening 
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assay may better identify ftsZ inhibitors because it is significantly more sensitive to 
inhibition by GDP than the polymerization screen described above (IC50 of 8 fiM vs. 
250 [iU). 



5 Example 21 

Screening Assay for ENR Inhibitors 

An assay to screen for ENR inhibitors, using non-radioactive high 

purity NADH, was developed. The isolation of ENR is described by Baldock et al.^ 

Science 274:2107 (1996). Briefly, ENR catalyzes the conversion of NADH and 

10 crotonyl-CoA to form NAD+ and fatty acyl-CoA, and the assay measures the amount 

of NAD+ produced in a second reaction wherein luciferase converts NAD+ to 
NADH. Light emission from the luciferase reaction is proportional to the amount of 
NAD+ produced in the initial reaction. A candidate inhibitor compound is added to 
the ENR reaction, and if the candidate inhibits ENR activity, the amount of light 

1 5 detected in the luciferase reaction is decreased. 

The assay was carried out as follows. Twenty |il of 30 jxM NADH 
(Boehringer Manneheim) in 20mM Hepes containing 6 ng/|il ENR or a total of 120 
ng per well and 20 }il of 10 ^iM candidate inhibitor compoimd in DMSO were added 
to a 96 well flat bottom optical plate. Twenty p,l of 300 jiM crotonyl-CoA (Sigma, 

20 C6146) was subsequently added to initiate the reaction. Triclosan was used as a 

control inhibitor and was included on each plate to verify inhibition. M the screening 
assay, triclosan inhibits with an IC50 of about 1 |iM. 

The reaction was allowed to continue for approximately ten minutes, 
corresponding to about 30 percent of the way to completion. Accordingly, the 

25 concentration of NAD+ should be approximately 3 p.M after ten minutes. 

Thirty p,l of 160mM HCl was added to the system to bring the pH of 
the reaction mixture below 2 and remove remaining NADH substrate. The reaction 
mixture was incubated for one minute following acid addition so that the remaining 
NADH decomposes to ADP-ribose and nicotinamide, NAD+ is substantially 

30 unaffected by the addition of strong acid. 

After the one minute period referenced immediately above, 1 10 |xl of 
a NADH regeneration/luciferase solution comprising alcohol dehydrogenase. 
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ethanoL FMN, FMN oxidoreductase, decanal and bacterial luciferase was added to 
the reaction mixture. 

More specifically, the NADH regeneration/luciferase solution was 
prepared in 1 10 iil of a buffer solution containing 300mM Tris (using a stock 1 M, 
5 pH 7.5 solution), 0.26 % by weight bis(trimethylsilyl)acetamide ("BMA"), 0.65 mM 

EDTA, and 18 mM KCL To this solution, 0.67 ^1 of decanal (Sigma, D7384, 98% 
purity) was added for every 1 0 ml of solution to yield a final solution having decanal 
concentration of approximately 200 ^iM. Sufficient FMN (Sigma F8399) was added 
to provide a final solution having a FMN concentration of approximately 2 jiM. 

10 Similarly, sufficient ethanol (200 proof) was added such that the final solution has an 

ethanol concentration of approximately 1 00 [xM. After adding all of these reagents to 
the solution, it was vortexed vigorously. 

To this solution, 1,08 |il of NADH:FMN oxidoreductase (Roche, 476 
480) was added for each 10 ml of solution, to yield a final solution having a 

15 concentration of 1.25 units per liter. Bacterial luciferase (Roche, 476 498) was added 

to yield a solution having approximately 4.5 |ig/mL Similarly, alcohol 
dehydrogenase was added to provide a solution having a final concentration of 0.7 
units per ml. After adding these reagents, the mixture was mixed gently by 
inversion. 

20 Approximately 1 00 compounds that inhibit ENR activity were 

identified in this assay. About 50 of these compounds exhibited significant 
inhibitory activity in a radiomimetric ENR assay. In this assay, twenty \il of 30 |iM 
^^P-NADH in 20mM Hepes was incubated with 120 ng of ENR per well and 20 jul 
of 10 ]LiM candidate inhibitor compound in DMSO. Twenty \xl of 300 \iM crotonyl- 

25 Co A (Sigma, C6146) was subsequently added to initiate the reaction. The products 

of the reaction include 32P-NAD. The reactant 32P-NADH and the product 32P- 
NAD are separated from each other by thin layer chromatography on PEI-cellulose in 
IM LiCl, and visualized by autoradiography. The extent of the reaction is 
determined by the conversion of NADH to NAD. These compounds were further 

30 tested for inhibition of E. coli growth. 
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A permeable bacterial strain (AB734 TN10::tolC) was used in the 
screening method to maximize the ability of the compounds to cross the gram 
negative cell wall. Assays were conducted in accordance with the NCCLS protocols 
referenced herein. 

5 It was of further interest to determine whether those compounds with 

antimicrobial activity worked in a ENR-dependent fashion. Two strains of the 
permeable toIC strain were constructed. In the first strain, the ENR protein was 
overexpressed by placing it under control of its own promoter on a moderate copy 
number plasmid, and the second strain served as a control including only the 

10 plasmid. A candidate compound that targets ENR should be much less active against 

the first strain described above because the target is substantially overexpressed. For 
example, the MIC for triclosan shifts fi*om 31 to 1000 ng/ml when tested against the 
first strain. Similarly, compound 325084 had a shift in MIC fi-om 25 [iM to greater 
than 100 |xM, suggesting that this compound exerts its antimicrobial action by virtue 

15 of inhibiting ENR during bacterial growth. The results do not distinguish between 

the possibilities that compound 325084 and triclosan act on ENR at the same or 
distinct sites on the enzyme. However, because no other compound showed a similar 
shift in MIC, it is believed that these other compounds probably inhibit bacterial 
growth through a different mechanism. Nonetheless, compounds 325085 and 

20 325086 have structures similar to compound 325084, and also demonstrated some 
activity against ENR, 

In order to determine if compound 325084 and triclosan act at the 
same or different ENR sites, recombinant ENR is produced which (i) includes a 
glycine to valine substitution at residue 93, (ii) retains enzymatic activity, and (iii) is 

25 insensitive to triclosan. Compounds that are identified as ENR inhibitors are then 

assayed using both wild type and mutant ENR and compounds that show little or no 
inhibitory activity against the mutant ENR form are probably acting at the active site 
of ENR and may be discarded. Alternatively, compoimds which inhibit the mutant 
enzyme and the wild type form may be acting at an allosteric site and will be studied 

30 further. 
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While the present invention has been described in terms of specific 
embodiments, it is understood that variations and modifications will occur to those 
skilled in the art. For example, with respect to the compounds disclosed herein, it 
should be understood that the substitution of one halogen substituent for another, 
different halogen substituent is within the scope of the present invention. 
Accordingly, only such limitations as appear in the appended claims should be placed 
on the present invention. 
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